Keming Wu
2 papers ยท Latest:
Computer Vision
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling
This paper proposes a new five-level taxonomy for visual generation, shifting from appearance synthesis to intelligent, agentic world modeling.
2604.28185
Computer VisionPRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning
PRISM introduces a black-box on-policy distillation stage to align large multimodal models, mitigating distributional drift between SFT and RLVR for improved performance.
2604.28123
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.