What is: GShard?
Source | GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
GShard is a intra-layer parallel distributed method. It consists of set of simple APIs for annotations, and a compiler extension in XLA for automatic parallelization.