From Seeing to Simulating: Generative High-Fidelity Simulation with Digital Cousins for Generalizable Robot Learning and Evaluation
Jasper Lu, Zhenhao Shen, Yuanfei Wang, Shugao Liu, Shengqiang Xu + 6 more
TLDR
This paper introduces Digital Cousins, a generative framework converting real-world panoramas into high-fidelity simulations for robust robot learning.
Key contributions
- Generative framework maps real-world panoramas to high-fidelity simulation scenes.
- Synthesizes diverse "cousin scenes" via semantic and geometric editing for robust data augmentation.
- Supports interactive manipulation tasks and long-horizon navigation in stitched large-scale environments.
- Demonstrates strong sim-to-real correlation, significantly improving robot generalization to novel variations.
Why it matters
Real-world robot data collection is expensive and hard to scale. This work offers a novel way to generate diverse, high-fidelity simulation data from real scenes, significantly reducing costs. It enables more robust and generalizable robot learning by overcoming data scarcity.
Original Abstract
Learning robust robot policies in real-world environments requires diverse data augmentation, yet scaling real-world data collection is costly due to the need for acquiring physical assets and reconfiguring environments. Therefore, augmenting real-world scenes into simulation has become a practical augmentation for efficient learning and evaluation. We present a generative framework that establishes a generative real-to-sim mapping from real-world panoramas to high-fidelity simulation scenes, and further synthesize diverse cousin scenes via semantic and geometric editing. Combined with high-quality physics engines and realistic assets, the generated scenes support interactive manipulation tasks. Additionally, we incorporate multi-room stitching to construct consistent large-scale environments for long-horizon navigation across complex layouts. Experiments demonstrate a strong sim-to-real correlation validating our platform's fidelity, and show that extensively scaling up data generation leads to significantly better generalization to unseen scene and object variations, demonstrating the effectiveness of Digital Cousins for generalizable robot learning and evaluation.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.