FunASR icon indicating copy to clipboard operation
FunASR copied to clipboard

热词间有相互干扰

Open kli017 opened this issue 1 year ago • 0 comments

Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节)

🐛 Bug

在runtime环境下使用speech_paraformer-large-contextual_asr_nat-zh-cn-16k-common-vocab8404-onnx模型。添加如下热词表时感觉热词直接会有相互干扰。比如 针灸铜人 80 久通 80

测试可能会出现针灸通人、久铜等结果。请问添加热词是单独提高token概率吗。如果是全词匹配的话,按说wfst里影响不会这么大。有无办法解决?

Environment

  • OS (e.g., Linux): Linux
  • FunASR Version (e.g., 1.0.0): FunASR
  • ModelScope Version (e.g., 1.11.0): runtime
  • PyTorch Version (e.g., 2.0.0):
  • How you installed funasr (pip, source):
  • Python version:
  • Docker version: funasr-runtime-sdk-online-cpu-0.1.9
  • Any other relevant information:

kli017 avatar May 14 '24 02:05 kli017