fairseq How to Qauntize Wav2vec model?

How to Qauntize Wav2vec model?

Open mehrdad78 opened this issue 2 years ago • 0 comments

hi there. I want to quantize my wav2vec model .u used dynamic_quantize but i t did not help and my model get slower then before. how should i that? i search docs and issues but did not find any useful content. i think probably , i should Dequantize _k , q , v ._ but how can do that?

search the issues.
search the docs.

model_int8 = torch.quantization.quantize_dynamic(
  model,
  {torch.nn.Linear},
   dtype=torch.qint8,
   inplace=True
)

transformer
PyTorch
Google Colab

Aug 10 '22 07:08 mehrdad78

fairseq fairseq copied to clipboard

How to Qauntize Wav2vec model?

fairseq
fairseq copied to clipboard