Cong Huang

3 papers · Latest: May 13, 2026

FrameSkip: Learning from Fewer but More Informative Frames in VLA Training

FrameSkip improves VLA policy training by selecting fewer, more informative frames from robot demonstrations, boosting success rates.

2605.13757May 13, 2026

Computer Vision

EA-WM: Event-Aware Generative World Model with Structured Kinematic-to-Visual Action Fields

EA-WM is a generative world model that uses structured kinematic-to-visual action fields to improve robot interaction dynamics and geometry in generated videos.

2605.06192May 7, 2026

Robotics

STARRY: Spatial-Temporal Action-Centric World Modeling for Robotic Manipulation

STARRY is a novel world model for robotic manipulation that aligns spatial-temporal prediction with action generation for improved task success.

2604.26848Apr 29, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.