Thomas Scialom

5 papers · Latest: July 31, 2024

The Llama 3 Herd of Models

Llama 3 is a new family of large multilingual foundation models excelling in language, coding, reasoning, and multimodal tasks, rivaling GPT-4 in quality and offering extensive public releases.

2407.21783Jul 31, 2024

Natural Language Processing

GAIA: a benchmark for General AI Assistants

GAIA is a new benchmark designed to evaluate AI assistants on real-world tasks requiring reasoning, multi-modality, web browsing, and tool use, highlighting a significant gap between AI and human performance.

2311.12983Nov 21, 2023

Natural Language Processing

Code Llama: Open Foundation Models for Code

Code Llama is a new family of open-source large language models specialized for coding tasks, achieving state-of-the-art results on multiple benchmarks with support for long contexts and code infilling.

2308.12950Aug 24, 2023

Natural Language Processing

Llama 2: Open Foundation and Fine-Tuned Chat Models

Llama 2 introduces a range of open-source large language models, including fine-tuned chat models that outperform existing open-source alternatives in benchmarks and human evaluations.

2307.09288Jul 18, 2023

Natural Language Processing

Toolformer: Language Models Can Teach Themselves to Use Tools

Toolformer enables language models to autonomously learn to use external tools via APIs, significantly enhancing their performance on diverse tasks without extra supervision.

2302.04761Feb 9, 2023

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.