Wenxuan Huang
4 papers ยท Latest:
Computer Vision
Flow-OPD: On-Policy Distillation for Flow Matching Models
Flow-OPD introduces an on-policy distillation framework for Flow Matching text-to-image models, resolving multi-task alignment issues.
2605.08063
Computer VisionSCOPE: Structured Decomposition and Conditional Skill Orchestration for Complex Image Generation
SCOPE is a framework that uses structured decomposition and conditional skill orchestration to maintain semantic commitments for complex text-to-image generation.
2605.08043
Computer VisionOpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents
OpenSearch-VL provides an open-source recipe for training frontier multimodal deep search agents, achieving state-of-the-art performance.
2605.05185
Artificial IntelligenceAblateCell: A Reproduce-then-Ablate Agent for Virtual Cell Repositories
AblateCell is an AI agent that reproduces baselines and performs systematic ablations on virtual cell repositories to identify critical components.
2604.19606
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.