Chunhua Shen
3 papers ยท Latest:
Computer Vision
MARBLE: Multi-Aspect Reward Balance for Diffusion RL
MARBLE introduces a gradient-space optimization framework to balance multiple rewards for diffusion RL, improving all dimensions simultaneously without manual weighting.
2605.06507
Computer VisionUnlocking the Power of Critical Factors for 3D Visual Geometry Estimation
This paper investigates critical factors in 3D visual geometry estimation, revealing insights and introducing CARVE, a resolution-enhanced model for robust performance.
2604.21713
Computer VisionMMControl: Unified Multi-Modal Control for Joint Audio-Video Generation
MMControl enables fine-grained multi-modal control for synchronized joint audio-video generation using a dual-stream diffusion transformer.
2604.19679
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.