MeloTTS icon indicating copy to clipboard operation
MeloTTS copied to clipboard

Adding Turkish language support

Open g-hano opened this issue 1 year ago • 7 comments

I created turkish.py and turkish_bert.py to support Turkish language. Used dbmdz/bert-base-turkish-cased as tokenizer

g-hano avatar Dec 23 '24 12:12 g-hano

@g-hano Did you train with Turkish data?

kadirnar avatar Dec 27 '24 11:12 kadirnar

@g-hano Did you train with Turkish data?

Yes I did, had to update many lines but finally was able to train on 4 T4s. Extra training on 46k audio samples made the model produce clearer outputs.

g-hano avatar Dec 29 '24 16:12 g-hano

Hello, thank you for supporting the Turkish TTS model, first of all. When I reviewed your code, I noticed that you convert text to lowercase during normalization. Unfortunately, for Turkish, uppercase "I" is converted to lowercase "i," which is incorrect. As a solution, the following change is needed: text.replace("I", "ı").lower(). Since you are using lowercase text, I recommend trying ytu-ce-cosmos/turkish-base-bert-uncased.

tosunozgun avatar Dec 29 '24 21:12 tosunozgun

Hello, thank you for supporting the Turkish TTS model, first of all. When I reviewed your code, I noticed that you convert text to lowercase during normalization. Unfortunately, for Turkish, uppercase "I" is converted to lowercase "i," which is incorrect. As a solution, the following change is needed: text.replace("I", "ı").lower(). Since you are using lowercase text, I recommend trying ytu-ce-cosmos/turkish-base-bert-uncased.

Thank you for your recommendations, I updated the code.

g-hano avatar Dec 30 '24 05:12 g-hano

Thanks for the great contribution @g-hano . I'm trying to test it however I couldn't find a way to test. Is there any way to test something like that? melo "Merhaba ben Türkçe konuşabiliyorum." output.wav --language TR

furkeen avatar Jan 20 '25 05:01 furkeen

I am developing the MeloPlus library to add more features to the MeloTts library. I added Thai and Turkish language support.

@furkeen You should find a Turkish model trained with the melotts model.

kadirnar avatar Jan 22 '25 23:01 kadirnar

Any huggingface space for testing? or sample output?

karayakar avatar Aug 05 '25 00:08 karayakar