Thomas Scialom
5 papers ยท Latest:
The Llama 3 Herd of Models
Llama 3 is a new family of large multilingual foundation models excelling in language, coding, reasoning, and multimodal tasks, rivaling GPT-4 in quality and offering extensive public releases.
GAIA: a benchmark for General AI Assistants
GAIA is a new benchmark designed to evaluate AI assistants on real-world tasks requiring reasoning, multi-modality, web browsing, and tool use, highlighting a significant gap between AI and human performance.
Code Llama: Open Foundation Models for Code
Code Llama is a new family of open-source large language models specialized for coding tasks, achieving state-of-the-art results on multiple benchmarks with support for long contexts and code infilling.
Llama 2: Open Foundation and Fine-Tuned Chat Models
Llama 2 introduces a range of open-source large language models, including fine-tuned chat models that outperform existing open-source alternatives in benchmarks and human evaluations.
Toolformer: Language Models Can Teach Themselves to Use Tools
Toolformer enables language models to autonomously learn to use external tools via APIs, significantly enhancing their performance on diverse tasks without extra supervision.
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.