Wenbo Ding

5 papers · Latest: May 7, 2026

OA-WAM: Object-Addressable World Action Model for Robust Robot Manipulation

OA-WAM introduces an object-addressable world action model that decomposes scenes into persistent object slots for robust robot manipulation under scene shifts.

2605.06481May 7, 2026

Artificial Intelligence

Thinking in Text and Images: Interleaved Vision--Language Reasoning Traces for Long-Horizon Robot Manipulation

IVLR introduces an interleaved vision-language reasoning trace for long-horizon robot manipulation, achieving high success on complex tasks.

2605.00438May 1, 2026

Robotics

Walk With Me: Long-Horizon Social Navigation for Human-Centric Outdoor Assistance

Walk with Me is a map-free framework enabling robots to perform safe, long-horizon social navigation outdoors using high-level human instructions.

2604.26839Apr 29, 2026

Robotics

Learning Human-Intention Priors from Large-Scale Human Demonstrations for Robotic Manipulation

MoT-HRA learns human-intention priors from 2.2M human video demonstrations to enable robust robotic manipulation through a hierarchical vision-language-action framework.

2604.24681Apr 27, 2026

Robotics

Agent-Centric Visual Reinforcement Learning under Dynamic Perturbations

ACO-MoE robustifies visual RL against dynamic perturbations by using agent-centric restoration experts, achieving near clean performance on a new benchmark.

2604.24661Apr 27, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.