Yuan Tian

4 papers · Latest: May 12, 2026

Options, Not Clicks: Lattice Refinement for Consent-Driven MCP Authorization

Conleash is a client-side middleware that uses a risk lattice and policy engine to provide consent-driven, boundary-scoped authorization for MCP tool invocations.

2605.11360May 12, 2026

Cryptography & Security

Semia: Auditing Agent Skills via Constraint-Guided Representation Synthesis

Semia audits LLM agent skills by converting them into a Datalog fact base using CGRS, finding critical risks in over half of real-world skills.

2605.00314May 1, 2026

Software Engineering

When LLMs Lag Behind: Knowledge Conflicts from Evolving APIs in Code Generation

LLMs struggle with evolving APIs, even with RAG, due to context-memory conflicts, requiring new benchmarks and techniques for reliable code generation.

2604.09515Apr 10, 2026

Natural Language Processing

Gemini: A Family of Highly Capable Multimodal Models

Gemini is a new family of multimodal AI models excelling in image, audio, video, and text understanding, achieving state-of-the-art results across numerous benchmarks including human-expert level on MMLU.

2312.11805Dec 19, 2023

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.