FunASR
FunASR copied to clipboard
Websocket 服务中针对不同音频识别结果差异巨大
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节)
❓ Questions and Help
Before asking:
- search the issues.
- search the docs.
What is your question?
Why do the recognition results vary significantly when using the code provided in the repository for different audio samples?
Code
code path: FunASR/websocket/funasr_client_api.py wav_ path1: FunASR-main\runtime\funasr_api\asr_example.wav" outputs: connect to url ws://127.0.0.1:10095 send json {"mode": "2pass", "chunk_size": [0, 10, 5], "encoder_chunk_look_back": 4, "decoder_chunk_look_back": 1, "chunk_interval": 10, "wav_name": "default", "is_speaking": true} text {'mode': '2pass-online', 'text': '欢迎大', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '家来', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '体验达', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '摩院推', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '出的语', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '音识', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '别模型', 'wav_name': 'default', 'is_final': True}
wav_ path1: FunASR-main\runtime\funasr_api\SSB00050002.wav"
connect to url ws://127.0.0.1:10095 send json {"mode": "2pass", "chunk_size": [0, 10, 5], "encoder_chunk_look_back": 4, "decoder_chunk_look_back": 1, "chunk_interval": 10, "wav_name": "default", "is_speaking": true} text {'mode': '2pass-online', 'text': '群', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '你', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-offline', 'text': '春日,天寒。', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '嗯', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-offline', 'text': '电话。', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '嗯', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-online', 'text': '嗯', 'wav_name': 'default', 'is_final': True} text {'mode': '2pass-offline', 'text': '我的勇器。', 'wav_name': 'default', 'is_final': True}
What have you tried?
What's your environment?
- OS (e.g., Linux):
- FunASR Version (e.g., 1.0.0):
- ModelScope Version (e.g., 1.11.0):
- PyTorch Version (e.g., 2.0.0):
- How you installed funasr (
pip, source): - Python version:
- GPU (e.g., V100M32)
- CUDA/cuDNN version (e.g., cuda11.7):
- Docker version (e.g., funasr-runtime-sdk-cpu-0.4.1)
- Any other relevant information: