Alexander Kolesnikov
2 papers ยท Latest:
Computer Vision
Scaling Vision Transformers
This paper studies how Vision Transformers scale with model size and data, improving their architecture and training to achieve state-of-the-art ImageNet accuracy with a 2-billion parameter model.
2106.04560
Computer VisionAn Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
This paper demonstrates that a pure Transformer model applied directly to image patches can achieve state-of-the-art image classification performance without relying on convolutional networks.
2010.11929
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.