What is: Online Normalization?
Source | Online Normalization for Training Neural Networks |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
Online Normalization is a normalization technique for training deep neural networks. To define Online Normalization. we replace arithmetic averages over the full dataset in with exponentially decaying averages of online samples. The decay factors and for forward and backward passes respectively are hyperparameters for the technique.
We allow incoming samples , such as images, to have multiple scalar components and denote feature-wide mean and variance by and . The algorithm also applies to outputs of fully connected layers with only one scalar output per feature. In fact, this case simplifies to and . Denote scalars and to denote running estimates of mean and variance across all samples. The subscript denotes time steps corresponding to processing new incoming samples.
Online Normalization uses an ongoing process during the forward pass to estimate activation means and variances. It implements the standard online computation of mean and variance generalized to processing multi-value samples and exponential averaging of sample statistics. The resulting estimates directly lead to an affine normalization transform.