Yaxuan Li
3 papers ยท Latest:
Robotics
dWorldEval: Scalable Robotic Policy Evaluation via Discrete Diffusion World Model
dWorldEval introduces a discrete diffusion world model for scalable robotic policy evaluation, unifying modalities and outperforming prior methods.
2604.22152
RoboticsHi-WM: Human-in-the-World-Model for Scalable Robot Post-Training
Hi-WM enables scalable robot post-training by allowing human intervention directly within a learned world model, reducing real-world execution needs.
2604.21741
Machine LearningRethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
This paper investigates on-policy distillation (OPD) dynamics in LLMs, identifying success conditions, token-level mechanisms, and practical recovery strategies.
2604.13016
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.