fxmarty

Results 332 comments of fxmarty

@bloodsucker99 You need to pass the environment variable `GPTQ_BITS` (though I think gptq_bits and gptq_groupsize could be directly inferred from the shapes of `qweights`, `qzeros`, `g_idx`?)

Hi, let me have a look next week.

Hi @mszsorondo , indeed the page https://huggingface.co/docs/transformers/serialization#export-to-onnx is a bit outdated. I'll do a PR to fix it. In your EDIT II, were you referring to this page? I'd recommend...

Hi @bhavnicksm , @mht-sharma just merged the Pegasus ONNX config yesterday! https://github.com/huggingface/optimum/pull/620

@bhavnicksm Can you open an issue in Optimum with your environment details? We can track it there!

Feel free! Don't hesitate to ask any question if needed.

@rcshubhadeep I moved your issue to https://github.com/huggingface/optimum/issues/968

Hi @rishabbala , sounds good, let us know if you encounter any help! A good reference is https://huggingface.co/docs/optimum/main/en/exporters/onnx/usage_guides/contribute