Xue Yang
3 papers ยท Latest:
Artificial Intelligence
ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox
ComplexMCP is a new benchmark evaluating LLM agents in dynamic, interdependent, and large-scale tool environments, revealing significant performance gaps.
2605.10787
Natural Language ProcessingDPN-LE: Dual Personality Neuron Localization and Editing for Large Language Models
DPN-LE precisely edits personality in LLMs by targeting specific, mutually exclusive neurons, preserving general capabilities better than prior methods.
2604.27929
Computer VisionMM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation
MM-WebAgent is a hierarchical multimodal agent that generates coherent and visually consistent webpages by coordinating AIGC elements through planning and self-reflection.
2604.15309
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.