What is: Highway Layer?
Source | Highway Networks |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
A Highway Layer contains an information highway to other layers that helps with information flow. It is characterised by the use of a gating unit to help this information flow.
A plain feedforward neural network typically consists of layers where the th layer ({}) applies a nonlinear transform (parameterized by ) on its input to produce its output . Thus, is the input to the network and is the network’s output. Omitting the layer index and biases for clarity,
is usually an affine transform followed by a non-linear activation function, but in general it may take other forms.
For a highway network, we additionally define two nonlinear transforms and such that:
We refer to T as the transform gate and C as the carry gate, since they express how much of the output is produced by transforming the input and carrying it, respectively. In the original paper, the authors set , giving:
The authors set:
Image: Sik-Ho Tsang