What is: AdaSqrt?
Source | Second-order Information in First-order Optimization Methods |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
AdaSqrt is a stochastic optimization technique that is motivated by the observation that methods like Adagrad and Adam can be viewed as relaxations of Natural Gradient Descent.
The updates are performed as follows: