PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World
Yunhan Yang, Chunshi Wang, Junliang Ye, Yang Li, Zanxin Chen + 5 more
TLDR
PhysForge generates physics-grounded 3D assets for interactive virtual worlds and embodied AI using a two-stage framework and a large-scale physical asset dataset.
Key contributions
- Introduces PhysForge, a two-stage framework for generating physics-grounded 3D assets.
- Utilizes PhysDB, a large dataset of 150,000 assets with four-tier physical annotations.
- A VLM plans a "Hierarchical Physical Blueprint" defining material, functional, and kinematic constraints.
- A physics-grounded diffusion model synthesizes geometry and kinematic parameters via KineVoxel Injection (KVI).
Why it matters
This paper addresses the bottleneck of creating interactive 3D assets for virtual worlds and embodied AI. PhysForge generates functionally plausible, simulation-ready assets, significantly advancing the development of interactive 3D content and intelligent agents. It provides a robust data engine for future research.
Original Abstract
Synthesizing physics-grounded 3D assets is a critical bottleneck for interactive virtual worlds and embodied AI. Existing methods predominantly focus on static geometry, overlooking the functional properties essential for interaction. We propose that interactive asset generation must be rooted in functional logic and hierarchical physics. To bridge this gap, we introduce PhysForge, a decoupled two-stage framework supported by PhysDB, a large-scale dataset of 150,000 assets with four-tier physical annotations. First, a VLM acts as a "physical architect" to plan a "Hierarchical Physical Blueprint" defining material, functional, and kinematic constraints. Second, a physics-grounded diffusion model realizes this blueprint by synthesizing high-fidelity geometry alongside precise kinematic parameters via a novel KineVoxel Injection (KVI) mechanism. Experiments demonstrate that PhysForge produces functionally plausible, simulation-ready assets, providing a robust data engine for interactive 3D content and embodied agents.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.