Stella Biderman
2 papers ยท Latest:
Natural Language Processing
Crosslingual Generalization through Multitask Finetuning
This paper demonstrates that multitask finetuning of large multilingual language models on English and machine-translated prompts enables strong zero-shot crosslingual generalization to many languages, including those unseen during training.
2211.01786
Natural Language ProcessingGPT-NeoX-20B: An Open-Source Autoregressive Language Model
GPT-NeoX-20B is a 20 billion parameter open-source autoregressive language model that demonstrates strong few-shot reasoning abilities and outperforms comparable models in multi-shot settings.
2204.06745
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.