Federico Tombari
2 papers ยท Latest:
Computer Vision
SSL-R1: Self-Supervised Visual Reinforcement Post-Training for Multimodal Large Language Models
SSL-R1 introduces a self-supervised visual reinforcement learning framework for MLLMs, deriving verifiable rewards directly from images.
2604.20705
Computer VisionR-CoV: Region-Aware Chain-of-Verification for Alleviating Object Hallucinations in LVLMs
R-CoV is a region-aware chain-of-verification method that significantly alleviates object hallucinations in large vision-language models post-hoc.
2604.20696
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.