Hao Fei
2 papers ยท Latest:
Computer Vision
Audio-Visual Intelligence in Large Foundation Models
This survey provides the first comprehensive review of Audio-Visual Intelligence (AVI) in large foundation models, unifying tasks, methods, and challenges.
2605.04045
Natural Language ProcessingTaming Actor-Observer Asymmetry in Agents via Dialectical Alignment
This paper introduces ReTAS, a new method to mitigate Actor-Observer Asymmetry in LLM agents by enforcing perspective-invariant reasoning.
2604.19548
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.