Ming-Hsuan Yang
2 papers ยท Latest:
Computer Vision
AlbumFill: Album-Guided Reasoning and Retrieval for Personalized Image Completion
AlbumFill is a training-free framework that retrieves identity-consistent references from personal albums for personalized image completion.
2605.02892
Computer VisionRes2Net: A New Multi-scale Backbone Architecture
Res2Net introduces a novel CNN building block that enhances multi-scale feature representation within a single residual block, improving performance across various vision tasks.
1904.01169
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.