Ami Baid
2 papers ยท Latest:
Computer Vision
Personal Visual Context Learning in Large Multimodal Models
This paper defines Personal VCL for LMMs, presents a benchmark, and proposes the Agentic Context Bank to enable personalized visual reasoning.
2605.10936
Computer VisionDon't Let the Video Speak: Audio-Contrastive Preference Optimization for Audio-Visual Language Models
ACPO mitigates video-driven audio hallucination in AVLMs by using a dual-axis preference learning framework for faithful audio grounding.
2604.14129
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.