Li Yang
2 papers ยท Latest:
Computer Vision
Divide-then-Diagnose: Weaving Clinician-Inspired Contexts for Ultra-Long Capsule Endoscopy Videos
This paper introduces a new task, dataset (VideoCAP), and framework (DiCE) for diagnosis-driven summarization of ultra-long capsule endoscopy videos.
2604.21814
Computer VisionNTIRE 2026 Challenge on Video Saliency Prediction: Methods and Results
NTIRE 2026 challenge overview: methods and results for video saliency prediction using a new 2,000-video dataset and crowdsourced fixations.
2604.14816
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.