Zhiyuan Liu
3 papers ยท Latest:
Machine Learning
DECO: Sparse Mixture-of-Experts with Dense-Comparable Performance on End-Side Devices
DECO is a sparse MoE model matching dense performance on end-side devices, offering 3x speedup and reduced storage overhead.
2605.10933
RoboticsMISTY: High-Throughput Motion Planning via Mixer-based Single-step Drifting
MISTY is a high-throughput, single-step motion planner using a mixer-based architecture, achieving state-of-the-art performance and significant speedup.
2604.21489
Machine LearningRethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
This paper investigates on-policy distillation (OPD) dynamics in LLMs, identifying success conditions, token-level mechanisms, and practical recovery strategies.
2604.13016
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.