A Mechanistic Analysis of Sim-and-Real Co-Training in Generative Robot Policies
Yu Lei, Minghuan Liu, Abhiram Maddukuri, Zhenyu Jiang, Yuke Zhu
TLDR
This paper mechanistically analyzes sim-and-real co-training in robot policies, identifying two key effects: representation alignment and importance reweighting.
Key contributions
- Analyzes sim-and-real co-training in robot policies, identifying its underlying mechanisms.
- Introduces "structured representation alignment" as a primary effect balancing cross-domain alignment and discernibility.
- Reveals the "importance reweighting effect" as a secondary mechanism from domain-dependent action weighting.
- Validates findings with controlled experiments and robot manipulation tasks, improving prior methods.
Why it matters
Co-training is crucial for generative robot policies but its underlying mechanisms were unclear. This paper provides a theoretical and empirical framework to understand its effectiveness, identifying two key effects. This deeper insight can lead to more robust and efficient co-training methods for robotics.
Original Abstract
Co-training, which combines limited in-domain real-world data with abundant surrogate data such as simulation or cross-embodiment robot data, is widely used for training generative robot policies. Despite its empirical success, the mechanisms that determine when and why co-training is effective remain poorly understood. We investigate the mechanism of sim-and-real co-training through theoretical analysis and empirical study, and identify two intrinsic effects governing performance. The first, \textbf{``structured representation alignment"}, reflects a balance between cross-domain representation alignment and domain discernibility, and plays a primary role in downstream performance. The second, the \textbf{``importance reweighting effect"}, arises from domain-dependent modulation of action weighting and operates at a secondary level. We validate these effects with controlled experiments on a toy model and extensive sim-and-sim and sim-and-real robot manipulation experiments. Our analysis offers a unified interpretation of recent co-training techniques and motivates a simple method that consistently improves upon prior approaches. More broadly, our aim is to examine the inner workings of co-training and to facilitate research in this direction.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.