What is: Universal Transformer?
Source | Universal Transformers |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
The Universal Transformer is a generalization of the Transformer architecture. Universal Transformers combine the parallelizability and global receptive field of feed-forward sequence models like the Transformer with the recurrent inductive bias of RNNs. They also utilise a dynamic per-position halting mechanism.