Jakob Uszkoreit
2 papers ยท Latest:
Computer Vision
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
This paper demonstrates that a pure Transformer model applied directly to image patches can achieve state-of-the-art image classification performance without relying on convolutional networks.
2010.11929
Natural Language ProcessingAttention Is All You Need
The paper introduces the Transformer, a novel neural network architecture based solely on attention mechanisms that outperforms traditional recurrent and convolutional models in sequence transduction tasks like machine translation.
1706.03762
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.