Vista4D: Video Reshooting with 4D Point Clouds

April 23, 20262604.21915

Kuan Heng Lin, Zhizheng Liu, Pablo Salamanca, Yash Kant, Ryan Burgert + 7 more

cs.CV

TLDR

Vista4D introduces a novel video reshooting framework that uses 4D point clouds to re-synthesize dynamic scenes with improved consistency and camera control.

Key contributions

Uses a 4D point cloud to ground input video and target cameras for robust video reshooting.
Employs static pixel segmentation and 4D reconstruction to preserve content and provide rich camera signals.
Achieves improved 4D consistency, camera control, and visual quality over state-of-the-art methods.
Generalizes to real-world applications such as dynamic scene expansion and 4D scene recomposition.

Why it matters

Video reshooting often fails with depth artifacts and poor camera control. Vista4D uses a 4D point cloud for robust, high-quality resynthesis from new viewpoints, advancing creative video production.

Original Abstract

We present Vista4D, a robust and flexible video reshooting framework that grounds the input video and target cameras in a 4D point cloud. Specifically, given an input video, our method re-synthesizes the scene with the same dynamics from a different camera trajectory and viewpoint. Existing video reshooting methods often struggle with depth estimation artifacts of real-world dynamic videos, while also failing to preserve content appearance and failing to maintain precise camera control for challenging new trajectories. We build a 4D-grounded point cloud representation with static pixel segmentation and 4D reconstruction to explicitly preserve seen content and provide rich camera signals, and we train with reconstructed multiview dynamic data for robustness against point cloud artifacts during real-world inference. Our results demonstrate improved 4D consistency, camera control, and visual quality compared to state-of-the-art baselines under a variety of videos and camera paths. Moreover, our method generalizes to real-world applications such as dynamic scene expansion and 4D scene recomposition. See our project page for results, code, and models: https://eyeline-labs.github.io/Vista4D

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers