Nicholas Andrews
2 papers ยท Latest:
Natural Language Processing
Inducing Artificial Uncertainty in Language Models
A new method induces artificial uncertainty in language models on easy data, improving their calibration and uncertainty quantification on challenging tasks.
2605.13595
Software EngineeringCan Coding Agents Reproduce Findings in Computational Materials Science?
AutoMat benchmarks LLM coding agents' ability to reproduce computational materials science findings, revealing current agents achieve low success rates.
2605.00803
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.