ArXiv TLDR

Learning Long-term Motion Embeddings for Efficient Kinematics Generation

🐦 Tweet
2604.11737

Nick Stracke, Kolja Bauer, Stefan Andreas Baumann, Miguel Angel Bautista, Josh Susskind + 1 more

cs.CV

TLDR

This paper introduces a method for efficient long-term motion generation using highly compressed learned embeddings, outperforming current video models.

Key contributions

  • Introduces a long-term motion embedding for efficient scene dynamics modeling.
  • Achieves 64x temporal compression of motion data for efficiency.
  • Generates realistic, goal-directed motions via text prompts or spatial pokes.
  • Outperforms state-of-the-art video models and specialized task-specific methods.

Why it matters

This paper significantly improves the efficiency of generating long-term motions, a critical step for visual intelligence. By compressing motion data, it allows for faster exploration of future possibilities. This advancement could enable more practical applications in areas like robotics and animation.

Original Abstract

Understanding and predicting motion is a fundamental component of visual intelligence. Although modern video models exhibit strong comprehension of scene dynamics, exploring multiple possible futures through full video synthesis remains prohibitively inefficient. We model scene dynamics orders of magnitude more efficiently by directly operating on a long-term motion embedding that is learned from large-scale trajectories obtained from tracker models. This enables efficient generation of long, realistic motions that fulfill goals specified via text prompts or spatial pokes. To achieve this, we first learn a highly compressed motion embedding with a temporal compression factor of 64x. In this space, we train a conditional flow-matching model to generate motion latents conditioned on task descriptions. The resulting motion distributions outperform those of both state-of-the-art video models and specialized task-specific approaches.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.