Yaxuan Li

3 papers · Latest: April 24, 2026

dWorldEval: Scalable Robotic Policy Evaluation via Discrete Diffusion World Model

dWorldEval introduces a discrete diffusion world model for scalable robotic policy evaluation, unifying modalities and outperforming prior methods.

2604.22152Apr 24, 2026

Robotics

Hi-WM: Human-in-the-World-Model for Scalable Robot Post-Training

Hi-WM enables scalable robot post-training by allowing human intervention directly within a learned world model, reducing real-world execution needs.

2604.21741Apr 23, 2026

Machine Learning

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

This paper investigates on-policy distillation (OPD) dynamics in LLMs, identifying success conditions, token-level mechanisms, and practical recovery strategies.

2604.13016Apr 14, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.