Alexander Kolesnikov

2 papers · Latest: June 8, 2021

Scaling Vision Transformers

This paper studies how Vision Transformers scale with model size and data, improving their architecture and training to achieve state-of-the-art ImageNet accuracy with a 2-billion parameter model.

2106.04560Jun 8, 2021

Computer Vision

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

This paper demonstrates that a pure Transformer model applied directly to image patches can achieve state-of-the-art image classification performance without relying on convolutional networks.

2010.11929Oct 22, 2020

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.