Topic: Transformers

Track this topic after sign-in.

Short answer

This page shows the most relevant public items for Transformers, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

Weekly Monthly All time

← Back to home

Attention Is All You Need
Paper • Jun 12, 2017 • arXiv • Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin
The dominant sequence transduction models are based on complex recurrent or convolutional neural networks that include an encoder and a decoder. The best performing models also connect the encoder ...
Transformers for Natural Language Processing and Computer Vision
Book • Jan 31, 2024 • Amazon • Denis Rothman
Transformer architectures have evolved far beyond text generation to completely dominate visual tasks. This extensive guide explores how attention mechanisms are reshaping image processing and mult...
Hands-On Large Language Models: Language Understanding and Generation
Book • Oct 15, 2024 • Amazon • Jay Alammar, Maarten Grootendorst
Known for his incredibly clear visual explanations of AI architectures, Jay Alammar teams up with Maarten Grootendorst to provide a deeply practical guide to using modern LLMs. This book focuses he...
Generating Long Sequences with Sparse Attention
Paper • Apr 23, 2019 • arXiv • Rewon Child, Scott Gray, Alec Radford, Ilya Sutskever
Transformers are powerful sequence models, but their self-attention mechanism scales quadratically with sequence length, making them computationally prohibitive for long inputs like high-resolution...
Improving Language Understanding by Generative Pre-Training
Paper • Jun 11, 2018 • OpenAI • Alec Radford, Karthik Narasimhan, Tim Salimans, Ilya Sutskever
Natural language understanding comprises a wide range of diverse tasks such as textual entailment, question answering, semantic similarity assessment, and document classification. Although large un...
RT-1: Robotics Transformer for Real-World Control at Scale
Paper • Dec 13, 2022 • arXiv • Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Joseph Dabis, Chelsea Finn, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Tomas Jackson, Google DeepMind
By transferring knowledge from large, diverse, task-agnostic datasets, modern machine learning models can solve specific downstream tasks either zero-shot or with small task-specific datasets. We i...
Perceiver: General Perception with Iterative Attention
Paper • Mar 4, 2021 • arXiv • Andrew Jaegle, Felix Gimeno, Andrew Brock, Oriol Vinyals, Andrew Zisserman, Joao Carreira
Biological systems perceive the world by simultaneously processing high-dimensional inputs from modalities as diverse as vision, audition, and touch. We introduce the Perceiver, an architecture tha...
Generative Pretraining from Pixels
Paper • Jun 17, 2020 • OpenAI • Mark Chen, Alec Radford, Rewon Child, Jeffrey Wu, Heewoo Jun, David Luan, Ilya Sutskever
Inspired by the success of unsupervised representation learning in natural language processing with models like GPT-2, we examine whether similar models can learn useful representations for images....

FAQ

What does this Transformers page rank?

It ranks public content for Transformers using recent discussion, review, and engagement signals so you can triage faster. This guidance is specific to Transformers topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How should I use weekly vs monthly vs all-time?

Use weekly for fast-moving updates, monthly for stable trend confirmation, and all-time for evergreen references. This guidance is specific to Transformers topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How can I discover organizations active in Transformers?

Use the linked entities section to jump to labs, companies, and experts connected to this topic and explore their timelines. This guidance is specific to Transformers topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Can I follow this topic for updates?

Yes. Use the follow button on this page to subscribe and track new high-signal activity. This guidance is specific to Transformers topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Topic: Transformers

Short answer

Attention Is All You Need

Transformers for Natural Language Processing and Computer Vision

Hands-On Large Language Models: Language Understanding and Generation

Generating Long Sequences with Sparse Attention

Improving Language Understanding by Generative Pre-Training

RT-1: Robotics Transformer for Real-World Control at Scale

Perceiver: General Perception with Iterative Attention

Generative Pretraining from Pixels

Top Entities In This Topic

Related Topics

FAQ

What does this Transformers page rank?

How should I use weekly vs monthly vs all-time?

How can I discover organizations active in Transformers?

Can I follow this topic for updates?