A Hub for Transformer Blogs and Papers
This is a growing list of pointers to useful blog posts and papers related to transformers.
Transformers explained
- Blog: The Illustrated Transformer has many intuitive animations of how transformer models work
- Blog: Universal Transformers introduces the idea of recurrence among layers
- Blog: Transformer vs RNN and CNN for Translation Task
GNNs: similarities and differences
- Blog: Transformers are Graph Neural Networks bridges transformer models and Graph Neural Networks
Transformer improvements
- Blog: DeepMind Releases a New Architecture and a New Dataset to Improve Long-Term Memory in Deep Learning Systems Nural Turing Machine + transformer?