Morteza Heidari issues

Repositories
Issues
Comments

Results 2 issues of


                                            Morteza Heidari

How to Perform Static Quantization Directly on an ONNX Model Using Intel® Neural Compressor?

Hello, I'm using Intel® Neural Compressor (INC) to perform static quantization on my custom PyTorch model. I followed [this script](https://github.com/oneapi-src/oneAPI-samples/blob/master/AI-and-Analytics/Getting-Started-Samples/INC-Quantization-Sample-for-PyTorch/quantize_with_inc.ipynb) which demonstrates how to apply static quantization using INC on...

question

PEFT to ONNX conversion

### System Info ```shell Hello! I have a fine-tuned LLM model from Hugging Face saved in PEFT format, and it’s about 2.1 GB. When we convert it to ONNX, its...

bug