SenseVoice
SenseVoice copied to clipboard

Published 20 hours ago •

Reame
Issues

导出的onnx 模型比正常的模型推理慢

Open DuBaiSheng opened this issue 5 months ago • 2 comments

使用export 导出的onnx格式的模型，并使用SenseVoiceSmall加载，批次推理的时长，比原本使用AutoModel加载的原始模型要慢7倍。是什么原因呢，都是使用GPU加载推理。

Sep 25 '24 06:09 DuBaiSheng