Wei-Lin Chiang
2 papers ยท Latest:
Artificial Intelligence
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents
ClawEnvKit automates diverse environment generation for claw-like agents from natural language, enabling scalable evaluation and adaptive training.
2604.18543
Natural Language ProcessingJudging LLM-as-a-Judge with MT-Bench and Chatbot Arena
This paper demonstrates that strong large language models like GPT-4 can effectively serve as judges to evaluate other LLM-based chat assistants, closely matching human preferences on open-ended tasks.
2306.05685
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.