Tianyi Zhou
4 papers ยท Latest:
Superminds Test: Actively Evaluating Collective Intelligence of Agent Society via Probing Agents
Superminds Test reveals that collective intelligence does not spontaneously emerge in large-scale LLM agent societies due to sparse, shallow interactions.
Convergent Evolution: How Different Language Models Learn Similar Number Representations
Different language models exhibit convergent evolution, learning similar periodic number representations, though geometric separability requires specific training conditions.
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents
ClawEnvKit automates diverse environment generation for claw-like agents from natural language, enabling scalable evaluation and adaptive training.
Benchmarks for Trajectory Safety Evaluation and Diagnosis in OpenClaw and Codex: ATBench-Claw and ATBench-CodeX
Introduces ATBench-Claw and ATBench-CodeX, new benchmarks for evaluating and diagnosing safety in agent trajectories for OpenClaw and OpenAI Codex.
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.