Yanjun Zhao
2 papers ยท Latest:
Computer Vision
Ramen: Robust Test-Time Adaptation of Vision-Language Models with Active Sample Selection
Ramen is a framework for robust test-time adaptation of vision-language models, handling mixed-domain shifts via active sample selection.
2604.21728
Information RetrievalPAPERMIND: Benchmarking Agentic Reasoning and Critique over Scientific Papers in Multimodal LLMs
PAPERMIND is a new benchmark evaluating multimodal LLMs' integrated reasoning and critique over scientific papers across diverse domains.
2604.21304
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.