请问对粤语的支持怎么样?
没具体测过。但是模型应该是有一些粤语歌曲+粤语能力的。有兴趣可以找个测试集测一下,能反馈结果的话~提前感谢
It is very bad. Tested on random sample of 200 items from Common Voice 17 Validated zh-yue, CER is basically 100%
Attached detailed results, TSV (wav file, target, hypothesis, RTF using RTX4090)
Me also got worst results on 8 test sets, the cer is around 75~90%. However, the result of Sichuan accent is way more better then Cantonese.
没具体测过。但是模型应该是有一些粤语歌曲+粤语能力的。有兴趣可以找个测试集测一下,能反馈结果的话~提前感谢
是否有计划开源一下lora微调的脚本,能用common voice的粤语子集进行微调
效果很差,我用了粤语天气预报和新闻播报试了,音频中无背景音干扰,真的不太行。