What is: Neural Network Compression Framework?
Source | Neural Network Compression Framework for fast model inference |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
Neural Network Compression Framework, or NNCF, is a Python-based framework for neural network compression with fine-tuning. It leverages recent advances of various network compression methods and implements some of them, namely quantization, sparsity, filter pruning and binarization. These methods allow producing more hardware-friendly models that can be efficiently run on general-purpose hardware computation units (CPU, GPU) or specialized deep learning accelerators.