distiller How to convert from a quantization-aware training model to a post-training quantization model?

How to convert from a quantization-aware training model to a post-training quantization model?

Open zhengge opened this issue 6 years ago • 5 comments

Jun 17 '19 06:06 zhengge

want to ask the same question

Jun 17 '19 07:06 dongzhen123

It seems that converting from a quantization-aware training model to a post-training quantization model is not yet supported by the document. https://nervanasystems.github.io/distiller/algo_quantization.html

Is there any plan to do this?

Aug 26 '19 03:08 robotcator

I (and I think from the Issues entries, many others as well) would also be interested :)

Jan 09 '20 09:01 asti205

Hi,

Sorry for the really late response... The way QAT was implemented - the model is re-quantized each minibatch run. So, in the end of the training, the model is actually already quantized and ready to use, i.e. the .weights are already quantized and so are the activations. No need to PostTrainQuantize the model.

Feb 19 '20 12:02 levzlotnik

@zhengge @dongzhen123 could you please elaborate what you mean when you say "convert the model", and also what do you aim to accomplish from this?

Thanks.

Jun 18 '20 08:06 shazib-summar

distiller distiller copied to clipboard

How to convert from a quantization-aware training model to a post-training quantization model?

distiller
distiller copied to clipboard