FunASR
FunASR copied to clipboard
热词间有相互干扰
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节)
🐛 Bug
在runtime环境下使用speech_paraformer-large-contextual_asr_nat-zh-cn-16k-common-vocab8404-onnx模型。添加如下热词表时感觉热词直接会有相互干扰。比如 针灸铜人 80 久通 80
测试可能会出现针灸通人、久铜等结果。请问添加热词是单独提高token概率吗。如果是全词匹配的话,按说wfst里影响不会这么大。有无办法解决?
Environment
- OS (e.g., Linux): Linux
- FunASR Version (e.g., 1.0.0): FunASR
- ModelScope Version (e.g., 1.11.0): runtime
- PyTorch Version (e.g., 2.0.0):
- How you installed funasr (
pip, source): - Python version:
- Docker version: funasr-runtime-sdk-online-cpu-0.1.9
- Any other relevant information: