YuAn Wang
5 papers ยท Latest:
Thinking in Text and Images: Interleaved Vision--Language Reasoning Traces for Long-Horizon Robot Manipulation
IVLR introduces an interleaved vision-language reasoning trace for long-horizon robot manipulation, achieving high success on complex tasks.
A Universal Dance of Galactic Disks: Ubiquitous Precession and Its Implications
Galactic disk precession is ubiquitous, driven by tidal torques, and significantly impacts galaxy evolution, including warps and satellite alignment.
How Code Representation Shapes False-Positive Dynamics in Cross-Language LLM Vulnerability Detection
Code representation significantly impacts false positives in cross-language LLM vulnerability detection, with text fine-tuning increasing FPR due to surface-cue memorization.
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents
GLM-5V-Turbo is a new foundation model integrating multimodal perception natively for enhanced agent reasoning, planning, and tool use across diverse contexts.
Learning Human-Intention Priors from Large-Scale Human Demonstrations for Robotic Manipulation
MoT-HRA learns human-intention priors from 2.2M human video demonstrations to enable robust robotic manipulation through a hierarchical vision-language-action framework.
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.