Ping Nie
2 papers ยท Latest:
Natural Language Processing
FlexSQL: Flexible Exploration and Execution Make Better Text-to-SQL Agents
FlexSQL is a text-to-SQL agent that achieves better performance by flexibly exploring schemas, executing diverse plans, and employing a multi-level repair mechanism.
2605.02815
Natural Language ProcessingClawBench: Can AI Agents Complete Everyday Online Tasks?
ClawBench introduces a real-world benchmark of 153 online tasks across 144 live platforms, revealing current AI agents struggle with everyday web automation.
2604.08523
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.