Federico Tombari

2 papers · Latest: April 22, 2026

SSL-R1: Self-Supervised Visual Reinforcement Post-Training for Multimodal Large Language Models

SSL-R1 introduces a self-supervised visual reinforcement learning framework for MLLMs, deriving verifiable rewards directly from images.

R-CoV is a region-aware chain-of-verification method that significantly alleviates object hallucinations in large vision-language models post-hoc.

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.