metavoice-src
metavoice-src copied to clipboard
Will it natively support other languages?
I saw that the readme described the training data mainly in English, and I was worried that it would not learn the prosody in other languages, for example, the prosody in Chinese should be very different from English. Will it be possible to learn other languages in the future just by fine-tuning?
Yes, please elaborate more on:
Support for (cross-lingual) voice cloning with finetuning. We have had success with as little as 1 minute training data for Indian speakers.
Can you please share the Indian speakers examples code and TTS results?
Joining the chorus here! I would like to fine-tune this for Italian speech2speech. Could you release a tutorial on how to do this?
I also want to raise this question, looking for spanish :)
The system looks fantastic.
Will it be possible to learn other languages in the future just by fine-tuning?
@faceair / anyone else here for that matter - until we release the fine-tuning code, are you open to testing this hypothesis with a LoRA implementation? It would greatly benefit the community. Others can adopt it for different languages too :)
Looking forward to multilingual support!
If there is some info on how to train the Lora and how to apply it, I’m all for it, in the end a LORA can be trained in consumer hardware like a 4090, isn’t it?
Will it be possible to learn other languages in the future just by fine-tuning?
@faceair / anyone else here for that matter - until we release the fine-tuning code, are you open to testing this hypothesis with a LoRA implementation? It would greatly benefit the community. Others can adopt it for different languages too :) I can try that.
Will it be possible to learn other languages in the future just by fine-tuning?
@faceair / anyone else here for that matter - until we release the fine-tuning code, are you open to testing this hypothesis with a LoRA implementation? It would greatly benefit the community. Others can adopt it for different languages too :)
Sure, would be interested
Please give us a few days, and we will share a reference implementation for this
I've added some initial pointers to this here: https://github.com/metavoiceio/metavoice-src/issues/70#issuecomment-1957337895
Would be interested for French LoRA here :)
Hope to support Chinese.
I'm closing this issue in favour of https://github.com/metavoiceio/metavoice-src/issues/70 where active work seems to be happening!