Yuan Tian
4 papers ยท Latest:
Options, Not Clicks: Lattice Refinement for Consent-Driven MCP Authorization
Conleash is a client-side middleware that uses a risk lattice and policy engine to provide consent-driven, boundary-scoped authorization for MCP tool invocations.
Semia: Auditing Agent Skills via Constraint-Guided Representation Synthesis
Semia audits LLM agent skills by converting them into a Datalog fact base using CGRS, finding critical risks in over half of real-world skills.
When LLMs Lag Behind: Knowledge Conflicts from Evolving APIs in Code Generation
LLMs struggle with evolving APIs, even with RAG, due to context-memory conflicts, requiring new benchmarks and techniques for reliable code generation.
Gemini: A Family of Highly Capable Multimodal Models
Gemini is a new family of multimodal AI models excelling in image, audio, video, and text understanding, achieving state-of-the-art results across numerous benchmarks including human-expert level on MMLU.
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.