Nan Duan
4 papers ยท Latest:
Computer Vision
OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation
OmniNFT proposes a novel diffusion RL framework to improve joint audio-video generation by addressing multi-modal challenges like gradient imbalance.
2605.12480
Machine LearningNear-Future Policy Optimization
NPO and AutoNPO enhance Reinforcement Learning with Verifiable Rewards (RLVR) by leveraging near-future policy checkpoints for improved off-policy learning.
2604.20733
Natural Language ProcessingOpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence
OpenSpatial is an open-source data engine and 3M-sample dataset that significantly improves spatial reasoning models, achieving SOTA performance.
2604.07296
Machine LearningSelf-Distilled RLVR
RLSD combines RLVR with self-distillation to provide fine-grained updates and reliable directions, improving LLM training stability and convergence.
2604.03128
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.