Yizhou Wang
3 papers ยท Latest:
Robotics
GazeVLA: Learning Human Intention for Robotic Manipulation
GazeVLA uses human gaze as an intention proxy to bridge the human-robot embodiment gap, improving robotic manipulation with less robot data.
2604.22615
Computer VisionDistorted or Fabricated? A Survey on Hallucination in Video LLMs
This survey categorizes and analyzes hallucinations in Video LLMs, detailing their types, causes, evaluation, and mitigation strategies.
2604.12944
Computer VisionVisually-grounded Humanoid Agents
Visually-grounded Humanoid Agents enable autonomous digital humans to perceive, reason, and act in novel 3D environments using visual observations.
2604.08509
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.