sherpa icon indicating copy to clipboard operation
sherpa copied to clipboard

High Word Error Rates % in Large Whisper V3

Open zhao-lun opened this issue 1 year ago • 1 comments
trafficstars

model_used=large_whisperv3 Hardware: A100 Dataset: aishell

client cmd

num_task=16
python3 client.py     --server-addr localhost   \
  --model-name whisper     --num-tasks $num_task   \
 --whisper-prompt "<|startoftranscript|><|zh|><|transcribe|><|notimestamps|>"    \
 --manifest-dir /sample_dataset/aishell1_test/ 

server is built using the sample dockerfile

output: RTF: 0.0092 total_duration: 32590.000 seconds (9.05 hours) processing time: 299.156 seconds (0.08 hours)

%WER = 53.34 Errors: 55 insertions, 0 deletions, 3773 substitutions, over 7176 reference words (3403 correct)

Hi, i followed setup instructions, and it ran without any issues. However, I noticed an abnormal WER percentage. Is this normal?

zhao-lun avatar Mar 07 '24 09:03 zhao-lun

@zhao-lun https://github.com/k2-fsa/icefall/blob/master/egs/aishell/ASR/whisper/decode.py#L286-L288, check this to do normalize before computing metrics.

If you have some free time, feel free to make a PR to triton-asr-client/client.py.

yuekaizhang avatar Mar 11 '24 06:03 yuekaizhang