Mike Zheng Shou

4 papers · Latest: May 13, 2026

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

AnyFlow introduces an any-step video diffusion model using flow map distillation, outperforming consistency-based methods and scaling with sampling steps.

2605.13724May 13, 2026

Robotics

World Action Models: The Next Frontier in Embodied AI

This survey introduces World Action Models (WAMs), a new embodied AI paradigm unifying predictive state modeling with action generation, providing a systematic overview.

2605.12090May 12, 2026

Computer Vision

Sparkle: Realizing Lively Instruction-Guided Video Background Replacement via Decoupled Guidance

Sparkle introduces a new dataset and benchmark for high-quality video background replacement, significantly improving model performance.

2605.06535May 7, 2026

Artificial Intelligence

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

This paper introduces a "levels x laws" taxonomy for agentic world models, synthesizing over 400 works and outlining a roadmap for future development.

2604.22748Apr 24, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.