Yuwei Guo
2 papers ยท Latest:
Computer Vision
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
UniVidX is a unified multimodal framework that leverages video diffusion priors for versatile video generation across diverse tasks with strong performance.
2605.00658
Computer VisionContext Unrolling in Omni Models
Omni is a unified multimodal model that uses 'Context Unrolling' to reason across diverse data types, improving performance and generation.
2604.21921
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.