What is: Crossbow?
Source | CROSSBOW: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
Crossbow is a single-server multi-GPU system for training deep learning models that enables users to freely choose their preferred batch size—however small—while scaling to multiple GPUs. Crossbow uses many parallel model replicas and avoids reduced statistical efficiency through a new synchronous training method. SMA, a synchronous variant of model averaging, is used in which replicas independently explore the solution space with gradient descent, but adjust their search synchronously based on the trajectory of a globally-consistent average model.