Xiaoshuai Hao
3 papers ยท Latest:
Artificial Intelligence
Thinking in Text and Images: Interleaved Vision--Language Reasoning Traces for Long-Horizon Robot Manipulation
IVLR introduces an interleaved vision-language reasoning trace for long-horizon robot manipulation, achieving high success on complex tasks.
2605.00438
RoboticsWalk With Me: Long-Horizon Social Navigation for Human-Centric Outdoor Assistance
Walk with Me is a map-free framework enabling robots to perform safe, long-horizon social navigation outdoors using high-level human instructions.
2604.26839
Computer VisionOneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation
OneVL introduces a unified VLA and World Model framework, achieving state-of-the-art latent Chain-of-Thought reasoning at real-time speed.
2604.18486
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.