ArXiv TLDR

BifrostUMI: Bridging Robot-Free Demonstrations and Humanoid Whole-Body Manipulation

🐦 Tweet
2605.03452

Chenhao Yu, Hongwu Wang, Youhao Hu, Jiachen Zhang, Yuanyuan Li + 1 more

cs.RO

TLDR

BifrostUMI enables robot-free data collection for humanoid robots using VR, training policies from human demonstrations for whole-body manipulation.

Key contributions

  • Robot-free data collection framework for humanoid whole-body manipulation.
  • Leverages lightweight VR to capture human demonstrations as sparse keypoint trajectories.
  • Trains high-level policies to predict future keypoints from captured visual features.
  • Robust keypoint retargeting pipeline for precise robot execution via whole-body control.

Why it matters

Current humanoid data collection is hindered by teleoperation limits. BifrostUMI offers an efficient, accessible solution, bridging natural human movements to robot control. This allows for training diverse and agile whole-body behaviors, overcoming a major bottleneck in humanoid policy learning.

Original Abstract

High-quality data collection is a fundamental cornerstone for training humanoid whole-body visuomotor policies. Current data acquisition paradigms predominantly rely on robot teleoperation, which is often hindered by limited hardware accessibility and low operational efficiency. Inspired by the Universal Manipulation Interface (UMI), we propose BifrostUMI, a portable, efficient, and robot-free data collection framework tailored for humanoid robots. BifrostUMI leverages lightweight VR devices to capture human demonstrations as sparse keypoint trajectories while simultaneously recording wrist-mounted visual data. These multimodal data are subsequently utilized to train a high-level policy network that predicts future keypoint trajectories conditioned on the captured visual features. Through a robust keypoint retargeting pipeline, keypoint trajectories are precisely mapped onto the robot's morphology and executed via a whole-body controller. This approach enables the seamless transfer of diverse and agile behaviors from natural human demonstrations to humanoid embodiments. We demonstrate the efficacy and versatility of the proposed framework across two distinct experimental scenarios.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.