Ami Baid

2 papers · Latest: May 11, 2026

Personal Visual Context Learning in Large Multimodal Models

This paper defines Personal VCL for LMMs, presents a benchmark, and proposes the Agentic Context Bank to enable personalized visual reasoning.

ACPO mitigates video-driven audio hallucination in AVLMs by using a dual-axis preference learning framework for faithful audio grounding.

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.