Canyu Zhao
2 papers ยท Latest:
Computer Vision
MARBLE: Multi-Aspect Reward Balance for Diffusion RL
MARBLE introduces a gradient-space optimization framework to balance multiple rewards for diffusion RL, improving all dimensions simultaneously without manual weighting.
2605.06507
Computer VisionMMControl: Unified Multi-Modal Control for Joint Audio-Video Generation
MMControl enables fine-grained multi-modal control for synchronized joint audio-video generation using a dual-stream diffusion transformer.
2604.19679
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.