What is: Local SGD?
Source | Local SGD Converges Fast and Communicates Little |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
Local SGD is a distributed training technique that runs SGD independently in parallel on different workers and averages the sequences only once in a while.