Cong Huang
3 papers ยท Latest:
Robotics
FrameSkip: Learning from Fewer but More Informative Frames in VLA Training
FrameSkip improves VLA policy training by selecting fewer, more informative frames from robot demonstrations, boosting success rates.
2605.13757
Computer VisionEA-WM: Event-Aware Generative World Model with Structured Kinematic-to-Visual Action Fields
EA-WM is a generative world model that uses structured kinematic-to-visual action fields to improve robot interaction dynamics and geometry in generated videos.
2605.06192
RoboticsSTARRY: Spatial-Temporal Action-Centric World Modeling for Robotic Manipulation
STARRY is a novel world model for robotic manipulation that aligns spatial-temporal prediction with action generation for improved task success.
2604.26848
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.