tortoise-tts icon indicating copy to clipboard operation
tortoise-tts copied to clipboard

Is additional training possible with tortoise-tts?

Open kbuwel opened this issue 1 year ago • 4 comments

I am very interested in this project. I think it's an interesting project that can create tts with a 10-second voice sample. I also think it's good to support multiple languages. However, there is a pronunciation problem. I've thought for a long time that if only the pronunciation is corrected, it could be a very good tool. Tortoise-tts read me a document written in Korean, but it's not very good to listen to in English pronunciation. I had the thought that I wanted to add Korean pronunciation to tortoise-tts. I want to know if additional training is possible and if there is a license cost, I am willing to pay if it is a reasonable cost. I would appreciate it if you could tell me how to add training to tortoise-tts.

kbuwel avatar Feb 26 '24 04:02 kbuwel

@kbuwel it's an open source project , no licensing required, you need to train a model from scratch for a new language with lots of data > 10k hours more is better, train a bpe tokenizer for Korean language.

manmay-nakhashi avatar Feb 26 '24 04:02 manmay-nakhashi

Is it possible to create tts with Korean pronunciation using just a tokenizer? Don't need audio data or large-text transcripts?

kbuwel avatar Feb 26 '24 04:02 kbuwel

@kbuwel no

manmay-nakhashi avatar Feb 26 '24 04:02 manmay-nakhashi

I need documentation to train the model for Tamil and Indian English with tortoise-tts. No proper information to build from scratch

dhamaraiselvi avatar May 09 '24 12:05 dhamaraiselvi