Abstract Sim2Real through Approximate Information States

April 16, 20262604.15289

cs.RO

TLDR

This paper formalizes abstract sim2real, proposing a method to correct abstract simulator dynamics using real-world data for successful policy transfer.

Key contributions

Formalizes the "abstract sim2real" problem using state abstraction from RL literature.
Shows abstract simulator dynamics require state history to be grounded to target tasks.
Proposes a method to correct abstract simulator dynamics using real-world task data.
Achieves successful policy transfer from abstract sim to real-world environments.

Why it matters

This paper addresses the critical challenge of deploying RL policies in complex real-world environments where perfect simulators are impractical. It provides a principled framework and a practical method to leverage imperfect, abstract simulators for robust sim2real transfer. This enables more scalable and effective robotic applications.

Original Abstract

In recent years, reinforcement learning (RL) has shown remarkable success in robotics when a fast and accurate simulator is available for a given task. When using RL and simulation, more simulator realism is generally beneficial but becomes harder to obtain as robots are deployed in increasingly complex and widescale domains. In such settings, simulators will likely fail to model all relevant details of a given target task and this observation motivates the study of sim2real with simulators that leave out key task details. In this paper, we formalize and study the abstract sim2real problem: given an abstract simulator that models a target task at a coarse level of abstraction, how can we train a policy with RL in the abstract simulator and successfully transfer it to the real-world? Our first contribution is to formalize this problem using the language of state abstraction from the RL literature. This framing shows that an abstract simulator can be grounded to match the target task if the grounded abstract dynamics take the history of states into account. Based on the formalism, we then introduce a method that uses real-world task data to correct the dynamics of the abstract simulator. We then show that this method enables successful policy transfer both in sim2sim and sim2real evaluation.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers