ArXiv TLDR

PhysInOne: Visual Physics Learning and Reasoning in One Suite

🐦 Tweet
2604.09415

Siyuan Zhou, Hejun Wang, Hu Cheng, Jinxi Li, Dongsheng Wang + 34 more

cs.CVcs.AIcs.LGcs.RO

TLDR

PhysInOne is a new large-scale dataset with 2 million videos and detailed annotations for training AI in physics-grounded visual reasoning.

Key contributions

  • Introduces PhysInOne, a large-scale synthetic dataset with 2 million videos across 153,810 dynamic 3D scenes.
  • Covers 71 physical phenomena with multi-object interactions, complex backgrounds, and comprehensive ground-truth.
  • Enables physics-aware video generation, future frame prediction, property estimation, and motion transfer.
  • Significantly enhances physical plausibility in foundation models and establishes a new benchmark.

Why it matters

This paper introduces PhysInOne, a massive dataset that addresses the critical lack of physically-grounded data for AI. It provides a new benchmark for developing more physically plausible world models. This is crucial for advancing AI in areas like generation, simulation, and embodied AI.

Original Abstract

We present PhysInOne, a large-scale synthetic dataset addressing the critical scarcity of physically-grounded training data for AI systems. Unlike existing datasets limited to merely hundreds or thousands of examples, PhysInOne provides 2 million videos across 153,810 dynamic 3D scenes, covering 71 basic physical phenomena in mechanics, optics, fluid dynamics, and magnetism. Distinct from previous works, our scenes feature multiobject interactions against complex backgrounds, with comprehensive ground-truth annotations including 3D geometry, semantics, dynamic motion, physical properties, and text descriptions. We demonstrate PhysInOne's efficacy across four emerging applications: physics-aware video generation, long-/short-term future frame prediction, physical property estimation, and motion transfer. Experiments show that fine-tuning foundation models on PhysInOne significantly enhances physical plausibility, while also exposing critical gaps in modeling complex physical dynamics and estimating intrinsic properties. As the largest dataset of its kind, orders of magnitude beyond prior works, PhysInOne establishes a new benchmark for advancing physics-grounded world models in generation, simulation, and embodied AI.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.