Zeyu Zheng
3 papers ยท Latest:
Natural Language Processing
VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation
VLAA-GUI is a modular framework for GUI automation, preventing early stopping and repetitive loops using a verifier, loop breaker, and search agent.
2604.21375
Software EngineeringExternalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering
LLM agents increasingly externalize capabilities like memory, skills, and protocols into surrounding infrastructure, transforming how they solve complex tasks.
2604.08224
Natural Language ProcessingGemini: A Family of Highly Capable Multimodal Models
Gemini is a new family of multimodal AI models excelling in image, audio, video, and text understanding, achieving state-of-the-art results across numerous benchmarks including human-expert level on MMLU.
2312.11805
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.