What is: Reformer?
Source | Reformer: The Efficient Transformer |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
Reformer is a Transformer based architecture that seeks to make efficiency improvements. Dot-product attention is replaced by one that uses locality-sensitive hashing, changing its complexity from O() to O(), where is the length of the sequence. Furthermore, Reformers use reversible residual layers instead of the standard residuals, which allows storing activations only once in the training process instead of times, where is the number of layers.