GS-Playground: A High-Throughput Photorealistic Simulator for Vision-Informed Robot Learning
Yufei Jia, Heng Zhang, Ziheng Zhang, Junzhe Wu, Mingrui Yu + 37 more
TLDR
GS-Playground is a high-throughput photorealistic simulator for vision-informed robot learning, integrating parallel physics with 3D Gaussian Splatting.
Key contributions
- Achieves 10^4 FPS at 640x480, enabling large-scale visual reinforcement learning.
- Integrates a novel parallel physics engine with batch 3D Gaussian Splatting rendering.
- Automated Real2Sim workflow reconstructs photorealistic, physically consistent environments.
- Effectively bridges perceptual and physical sim-to-real gaps for diverse embodied tasks.
Why it matters
This paper addresses the computational overhead of photorealistic rendering and the manual effort of 3D asset creation in embodied AI. By providing a high-throughput simulator and an automated Real2Sim workflow, GS-Playground significantly accelerates vision-centric robot learning and improves sim-to-real transfer.
Original Abstract
Embodied AI research is undergoing a shift toward vision-centric perceptual paradigms. While massively parallel simulators have catalyzed breakthroughs in proprioception-based locomotion, their potential remains largely untapped for vision-informed tasks due to the prohibitive computational overhead of large-scale photorealistic rendering. Furthermore, the creation of simulation-ready 3D assets heavily relies on labor-intensive manual modeling, while the significant sim-to-real physical gap hinders the transfer of contact-rich manipulation policies. To address these bottlenecks, we propose GS-Playground, a multi-modal simulation framework designed to accelerate end-to-end perceptual learning. We develop a novel high-performance parallel physics engine, specifically designed to integrate with a batch 3D Gaussian Splatting (3DGS) rendering pipeline to ensure high-fidelity synchronization. Our system achieves a breakthrough throughput of 10^4 FPS at 640x480 resolution, significantly lowering the barrier for large-scale visual RL. Additionally, we introduce an automated Real2Sim workflow that reconstructs photorealistic, physically consistent, and memory-efficient environments, streamlining the generation of complex simulation-ready scenes. Extensive experiments on locomotion, navigation, and manipulation demonstrate that GS-Playground effectively bridges the perceptual and physical gaps across diverse embodied tasks. Project homepage: https://gsplayground.github.io.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.