What is: Tofu?
Source | Supporting Very Large Models using Automatic Dataflow Graph Partitioning |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
Tofu is an intra-layer model parallel system that partitions very large DNN models across multiple GPU devices to reduce per-GPU memory footprint. Tofu is designed to partition a dataflow graph of fine-grained tensor operators used by platforms like MXNet and TensorFlow. To optimally partition different operators in a dataflow graph, Tofu uses a recursive search algorithm that minimizes the total communication cost.