Qing Liu

3 papers · Latest: May 6, 2026

Computer Vision

FairEnc: A Fair Vision-Language Model with Fair Vision and Text Encoders for Glaucoma Detection

FairEnc is a VLM pretraining method that debiases both vision and text encoders for fair glaucoma detection across diverse patient populations.

2605.04882May 6, 2026

Natural Language Processing

CC-OCR V2: Benchmarking Large Multimodal Models for Literacy in Real-world Document Processing

CC-OCR V2 introduces a new benchmark for evaluating Large Multimodal Models on real-world document processing, revealing current models fall short.

2605.03903May 5, 2026

Computer Vision

AlbumFill: Album-Guided Reasoning and Retrieval for Personalized Image Completion

AlbumFill is a training-free framework that retrieves identity-consistent references from personal albums for personalized image completion.

2605.02892May 4, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.