Naihao Deng
2 papers ยท Latest:
Human-Computer Interaction
Beyond Screenshots: Evaluating VLMs' Understanding of UI Animations
This paper evaluates VLMs' understanding of UI animations using a new dataset, finding they detect motion but struggle with high-level interpretation.
2604.26148
Artificial IntelligenceSafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models
SafetyALFRED evaluates multimodal LLMs' embodied safety planning, revealing a gap between hazard recognition and active mitigation in real-world scenarios.
2604.19638
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.