Shengyuan Liu

2 papers · Latest: April 30, 2026

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Claw-Eval-Live is a live benchmark for LLM agents, evaluating their performance on evolving real-world workflows with verifiable execution.

2604.28139Apr 30, 2026

Computer Vision

NeuroClaw Technical Report

NeuroClaw is a multi-agent AI system designed to make neuroimaging research more executable and reproducible by handling diverse data and complex pipelines.

2604.24696Apr 27, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.