FunASR icon indicating copy to clipboard operation
FunASR copied to clipboard

Websocket 服务中针对不同音频识别结果差异巨大

Open WeiminLee opened this issue 1 year ago • 4 comments

Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节)

❓ Questions and Help

Before asking:

  1. search the issues.
  2. search the docs.

What is your question?

Why do the recognition results vary significantly when using the code provided in the repository for different audio samples?

Code

code path: FunASR/websocket/funasr_client_api.py wav_ path1: FunASR-main\runtime\funasr_api\asr_example.wav" outputs: connect to url ws://127.0.0.1:10095 send json {"mode": "2pass", "chunk_size": [0, 10, 5], "encoder_chunk_look_back": 4, "decoder_chunk_look_back": 1, "chunk_interval": 10, "wav_name": "default", "is_speaking": true} text {'mode': '2pass-online', 'text': '欢迎大', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '家来', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '体验达', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '摩院推', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '出的语', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '音识', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '别模型', 'wav_name': 'default', 'is_final': True}

wav_ path1: FunASR-main\runtime\funasr_api\SSB00050002.wav"

connect to url ws://127.0.0.1:10095 send json {"mode": "2pass", "chunk_size": [0, 10, 5], "encoder_chunk_look_back": 4, "decoder_chunk_look_back": 1, "chunk_interval": 10, "wav_name": "default", "is_speaking": true} text {'mode': '2pass-online', 'text': '群', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '你', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-offline', 'text': '春日,天寒。', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '嗯', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-offline', 'text': '电话。', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '嗯', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '嗯', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-offline', 'text': '我的勇器。', 'wav_name': 'default', 'is_final': True}

What have you tried?

What's your environment?

  • OS (e.g., Linux):
  • FunASR Version (e.g., 1.0.0):
  • ModelScope Version (e.g., 1.11.0):
  • PyTorch Version (e.g., 2.0.0):
  • How you installed funasr (pip, source):
  • Python version:
  • GPU (e.g., V100M32)
  • CUDA/cuDNN version (e.g., cuda11.7):
  • Docker version (e.g., funasr-runtime-sdk-cpu-0.4.1)
  • Any other relevant information:

WeiminLee avatar Jul 15 '24 03:07 WeiminLee