A Mechanistic Analysis of Sim-and-Real Co-Training in Generative Robot Policies

April 15, 20262604.13645

Yu Lei, Minghuan Liu, Abhiram Maddukuri, Zhenyu Jiang, Yuke Zhu

cs.ROcs.AIcs.LG

TLDR

This paper mechanistically analyzes sim-and-real co-training in robot policies, identifying two key effects: representation alignment and importance reweighting.

Key contributions

Analyzes sim-and-real co-training in robot policies, identifying its underlying mechanisms.
Introduces "structured representation alignment" as a primary effect balancing cross-domain alignment and discernibility.
Reveals the "importance reweighting effect" as a secondary mechanism from domain-dependent action weighting.
Validates findings with controlled experiments and robot manipulation tasks, improving prior methods.

Why it matters

Co-training is crucial for generative robot policies but its underlying mechanisms were unclear. This paper provides a theoretical and empirical framework to understand its effectiveness, identifying two key effects. This deeper insight can lead to more robust and efficient co-training methods for robotics.

Original Abstract

Co-training, which combines limited in-domain real-world data with abundant surrogate data such as simulation or cross-embodiment robot data, is widely used for training generative robot policies. Despite its empirical success, the mechanisms that determine when and why co-training is effective remain poorly understood. We investigate the mechanism of sim-and-real co-training through theoretical analysis and empirical study, and identify two intrinsic effects governing performance. The first, \textbf{``structured representation alignment"}, reflects a balance between cross-domain representation alignment and domain discernibility, and plays a primary role in downstream performance. The second, the \textbf{``importance reweighting effect"}, arises from domain-dependent modulation of action weighting and operates at a secondary level. We validate these effects with controlled experiments on a toy model and extensive sim-and-sim and sim-and-real robot manipulation experiments. Our analysis offers a unified interpretation of recent co-training techniques and motivates a simple method that consistently improves upon prior approaches. More broadly, our aim is to examine the inner workings of co-training and to facilitate research in this direction.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers