sherpa-onnx
sherpa-onnx copied to clipboard
[Help wanted] Support quantization
trafficstars
TODO
- [ ] Support fp16 and/or int8 with TensorRT
Hi @csukuangfj ,
Currently I am doing some experiments with Zipformer models, let me know if there are active developments going on from your end for TensorRT low precision support, else I will take up on this task :)
else I will take up on this task :)
Thanks! That would be great. We don't have a plan to support tensorrt in the near future.