Tianyi Zhou

4 papers · Latest: April 24, 2026

Superminds Test: Actively Evaluating Collective Intelligence of Agent Society via Probing Agents

Superminds Test reveals that collective intelligence does not spontaneously emerge in large-scale LLM agent societies due to sparse, shallow interactions.

2604.22452Apr 24, 2026

Natural Language Processing

Convergent Evolution: How Different Language Models Learn Similar Number Representations

Different language models exhibit convergent evolution, learning similar periodic number representations, though geometric separability requires specific training conditions.

2604.20817Apr 22, 2026

Artificial Intelligence

ClawEnvKit: Automatic Environment Generation for Claw-Like Agents

ClawEnvKit automates diverse environment generation for claw-like agents from natural language, enabling scalable evaluation and adaptive training.

2604.18543Apr 20, 2026

Artificial Intelligence

Benchmarks for Trajectory Safety Evaluation and Diagnosis in OpenClaw and Codex: ATBench-Claw and ATBench-CodeX

Introduces ATBench-Claw and ATBench-CodeX, new benchmarks for evaluating and diagnosing safety in agent trajectories for OpenClaw and OpenAI Codex.

2604.14858Apr 16, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.