ArXiv TLDR

GeoRect4D: Geometry-Compatible Generative Rectification for Dynamic Sparse-View 3D Reconstruction

🐦 Tweet
2604.20784

Zhenlong Wu, Zihan Zheng, Xuanxuan Wang, Qianhe Wang, Hua Yang + 3 more

cs.CV

TLDR

GeoRect4D improves dynamic sparse-view 3D reconstruction by coupling explicit 3D consistency with generative refinement via a closed-loop optimization.

Key contributions

  • Employs a degradation-aware feedback loop combining dynamic 3DGS and a single-step diffusion rectifier.
  • Uses a structural locking mechanism and spatiotemporal attention for physical plausibility during generation.
  • Implements a progressive optimization with stochastic geometric purification and generative distillation.
  • Achieves state-of-the-art fidelity, perceptual quality, and spatiotemporal consistency on datasets.

Why it matters

This paper tackles the challenging problem of dynamic 3D scene reconstruction from sparse views, which often suffers from artifacts and inconsistencies. GeoRect4D effectively resolves the mismatch between generative priors and 3D geometry, leading to more robust and high-fidelity results. It offers a significant advancement for creating realistic dynamic 3D content.

Original Abstract

Reconstructing dynamic 3D scenes from sparse multi-view videos is highly ill-posed, often leading to geometric collapse, trajectory drift, and floating artifacts. Recent attempts introduce generative priors to hallucinate missing content, yet naive integration frequently causes structural drift and temporal inconsistency due to the mismatch between stochastic 2D generation and deterministic 3D geometry. In this paper, we propose GeoRect4D, a novel unified framework for sparse-view dynamic reconstruction that couples explicit 3D consistency with generative refinement via a closed-loop optimization process. Specifically, GeoRect4D introduces a degradation-aware feedback mechanism that incorporates a robust anchor-based dynamic 3DGS substrate with a single-step diffusion rectifier to hallucinate high-fidelity details. This rectifier utilizes a structural locking mechanism and spatiotemporal coordinated attention, effectively preserving physical plausibility while restoring missing content. Furthermore, we present a progressive optimization strategy that employs stochastic geometric purification to eliminate floaters and generative distillation to infuse texture details into the explicit representation. Extensive experiments demonstrate that GeoRect4D achieves state-of-the-art performance in reconstruction fidelity, perceptual quality, and spatiotemporal consistency across multiple datasets.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.