What is: Sparse Layer-wise Adaptive Moments optimizer for large Batch training?
Source | SLAMB: Accelerated Large Batch Training with Sparse Communication |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
Please enter a description about the method here