What is: Stochastic Weight Averaging?
Source | Averaging Weights Leads to Wider Optima and Better Generalization |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
Stochastic Weight Averaging is an optimization procedure that averages multiple points along the trajectory of SGD, with a cyclical or constant learning rate. On the one hand it averages weights, but it also has the property that, with a cyclical or constant learning rate, SGD proposals are approximately sampling from the loss surface of the network, leading to stochastic weights and helping to discover broader optima.