Cornelius Justin Satryo Hadi
Cornelius Justin Satryo Hadi
hi @ethan-digi have you solved this issue?
hi @MiXaiLL76 did you found a way to train new languages correctly? thanks
hi @aluminumbox , do you think it's better to train cosyvoice from scratch or just finetune the CosyVoice-300M base model if I want to train on new language? Also, should...
@aluminumbox should we also change the `language` parameter in the get_tokenizer config? ``` get_tokenizer: !name:whisper.tokenizer.get_tokenizer # change to !name:cosyvoice.tokenizer.tokenizer.get_tokenizer if you want to train with CosyVoice-300M-25Hz recipe multilingual: True num_languages:...
@aluminumbox is 800-900 hours audio enough for training new languages? or should I train it from scratch?
> no need to train from scratch, I think at least 5k+ hour is suitable to cover a new language should I finetune the llm and the flow, or just...
Hi @clement-pages are there any alternative to use SimAMResNet for pyannote speaker diarization embedding model?