Nan Duan

4 papers · Latest: May 12, 2026

OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

OmniNFT proposes a novel diffusion RL framework to improve joint audio-video generation by addressing multi-modal challenges like gradient imbalance.

2605.12480May 12, 2026

Machine Learning

Near-Future Policy Optimization

NPO and AutoNPO enhance Reinforcement Learning with Verifiable Rewards (RLVR) by leveraging near-future policy checkpoints for improved off-policy learning.

2604.20733Apr 22, 2026

Natural Language Processing

OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence

OpenSpatial is an open-source data engine and 3M-sample dataset that significantly improves spatial reasoning models, achieving SOTA performance.

2604.07296Apr 8, 2026

Machine Learning

Self-Distilled RLVR

RLSD combines RLVR with self-distillation to provide fine-grained updates and reliable directions, improving LLM training stability and convergence.

2604.03128Apr 3, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.