Jacob Hilton

3 papers · Latest: May 6, 2026

Estimating the expected output of wide random MLPs more efficiently than sampling

This paper introduces a novel method to estimate the expected output of wide random MLPs without sampling, using cumulants and Hermite expansions.

2605.05179May 6, 2026

Natural Language Processing

Training language models to follow instructions with human feedback

This paper presents InstructGPT, a method to align language models with user intent by fine-tuning GPT-3 using human feedback, resulting in more truthful, helpful, and less toxic outputs.

2203.02155Mar 4, 2022

Natural Language Processing

WebGPT: Browser-assisted question-answering with human feedback

WebGPT fine-tunes GPT-3 to answer complex questions by browsing the web and using human feedback to improve factual accuracy and answer quality.

2112.09332Dec 17, 2021

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.