Kaichen Zhang
2 papers ยท Latest:
Artificial Intelligence
Senses Wide Shut: A Representation-Action Gap in Omnimodal LLMs
Omnimodal LLMs struggle to reject false textual claims contradicting sensory input, revealing a "Representation-Action Gap" in grounding.
2605.13737
Computer VisionVisual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling
This paper proposes a new five-level taxonomy for visual generation, shifting from appearance synthesis to intelligent, agentic world modeling.
2604.28185
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.