model-optimization
model-optimization copied to clipboard
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
Remove APIs / logic that is overdue for removal.
**System information** - TensorFlow version (you are using): 2.7 - Are you willing to contribute it (Yes/No): Yes **Motivation** Pruning and clustering both do not currently support tensorflow hub models....
Add support to enable QAT and PCQAT of GELU of a GELU layer or DENSE/CONV layer with GELU activation function.
**Describe the Issue** Activity Regularizer not working with quantization aware training (QAT). TypeError: An op outside of the function building code is being passed a "Graph" tensor. **System information** TensorFlow...
**Describe the bug** **System information** TensorFlow version (installed from source or binary): 2.6.0 TensorFlow Model Optimization version (installed from source or binary): 0.7.0 Python version: 3.7.10 **Describe the expected behavior**...
the API doc of prune_low_magnitude seems to be incomplete
Adapted from the source published with "DRIVE: One-bit Distributed Mean Estimation" (NeurIPS '21; https://arxiv.org/pdf/2105.08339.pdf), and the algorithm presented therein. Without going into the details of the paper, the key reason...
Trying to prune MobilenetV3 Large model after training but showing an error of - **ValueError: Please initialize `Prune` with a supported layer. Layers should either be supported by the PruneRegistry...
Hello, I have used your QAT model to quantize to different bitwidths, but I saw that the quantizations were always to FP values, even if they were quantized (e.g., if...