Morteza Heidari
Results
2
issues of
Morteza Heidari
Hello, I'm using Intel® Neural Compressor (INC) to perform static quantization on my custom PyTorch model. I followed [this script](https://github.com/oneapi-src/oneAPI-samples/blob/master/AI-and-Analytics/Getting-Started-Samples/INC-Quantization-Sample-for-PyTorch/quantize_with_inc.ipynb) which demonstrates how to apply static quantization using INC on...
question
### System Info ```shell Hello! I have a fine-tuned LLM model from Hugging Face saved in PEFT format, and it’s about 2.1 GB. When we convert it to ONNX, its...
bug