aimet icon indicating copy to clipboard operation
aimet copied to clipboard

Model Quantization Problems

Open lblbk opened this issue 3 years ago • 2 comments

I need to split a large model into three smaller models, how can these three models be quantitatively trained at the same time? thx.

lblbk avatar Jan 19 '22 03:01 lblbk

Hi @lblbk Could you please elaborate on the model architecture and provide more context, such as training framework used? And, AIMET offers algorithms for compression of model and model quantization. Please find more information here : https://github.com/quic/aimet#readme . Please let us know if you have further questions.

quic-ssiddego avatar Jan 22 '22 04:01 quic-ssiddego

Thx for you reply. My model consists of three parts, backbone, decoder and refiner.The original plan was to train the entire model together.But this is too slow, so the calculation divides them into three parts, similar to a cascaded network. My training framework is pytorch and Finetuning with aimet.

lblbk avatar Jan 26 '22 02:01 lblbk