Gargi Ghosh

2 papers · Latest: May 8, 2026

Fast Byte Latent Transformer

The Fast Byte Latent Transformer (BLT) introduces novel training and generation techniques to significantly speed up byte-level language models.

2605.08044May 8, 2026

Natural Language Processing

LIMA: Less Is More for Alignment

LIMA shows that fine-tuning a large language model on just 1,000 curated examples can achieve performance comparable to state-of-the-art models, highlighting the dominant role of pretraining over extensive instruction tuning.

2305.11206May 18, 2023

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.