Sharan Narang

5 papers · Latest: July 31, 2024

The Llama 3 Herd of Models

Llama 3 is a new family of large multilingual foundation models excelling in language, coding, reasoning, and multimodal tasks, rivaling GPT-4 in quality and offering extensive public releases.

2407.21783Jul 31, 2024

Natural Language Processing

Llama 2: Open Foundation and Fine-Tuned Chat Models

Llama 2 introduces a range of open-source large language models, including fine-tuned chat models that outperform existing open-source alternatives in benchmarks and human evaluations.

2307.09288Jul 18, 2023

Natural Language Processing

PaLM: Scaling Language Modeling with Pathways

PaLM is a 540-billion parameter Transformer language model that achieves state-of-the-art few-shot learning performance across diverse benchmarks, demonstrating significant benefits from scaling.

2204.02311Apr 5, 2022

Natural Language Processing

Self-Consistency Improves Chain of Thought Reasoning in Language Models

Self-consistency is a new decoding strategy that improves chain-of-thought reasoning in language models by sampling diverse reasoning paths and selecting the most consistent answer.

2203.11171Mar 21, 2022

Machine Learning

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

This paper introduces a unified text-to-text framework for transfer learning in NLP, achieving state-of-the-art results across diverse language tasks by systematically exploring pre-training and fine-tuning strategies.

1910.10683Oct 23, 2019

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.