Yifan Xie
3 papers ยท Latest:
Robotics
OA-WAM: Object-Addressable World Action Model for Robust Robot Manipulation
OA-WAM introduces an object-addressable world action model that decomposes scenes into persistent object slots for robust robot manipulation under scene shifts.
2605.06481
Artificial IntelligenceThinking in Text and Images: Interleaved Vision--Language Reasoning Traces for Long-Horizon Robot Manipulation
IVLR introduces an interleaved vision-language reasoning trace for long-horizon robot manipulation, achieving high success on complex tasks.
2605.00438
RoboticsLearning Human-Intention Priors from Large-Scale Human Demonstrations for Robotic Manipulation
MoT-HRA learns human-intention priors from 2.2M human video demonstrations to enable robust robotic manipulation through a hierarchical vision-language-action framework.
2604.24681
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.