Liam Doherty
Liam Doherty
Thanks for raising this question, but I'm not sure I quite understand the issue. Heterophones are explicitly included in the data structure, as described in the Readme: > Where multiple...
@sancarn The more accents/dialects/speech varieties the better! The current approach is to separate these into different dictionaries so that the list for each language variant is, internally speaking, as phonemically...
@nicolasdevops Yes, the last release was quite some time ago and newer languages have not been included. I can generate a new release which should add all the current languages....
It would be great to add Italian -- do you know of any sources for such a dictionary? If there isn't something already available under an open license, it might...
@loretoparisi Fantastic! Thanks for finding all this info. I wasn't aware that there were CMU dictionaries for other languages. The transcription format is indeed a little odd, but luckily it's...
@loretoparisi That's amazing! Sounds like it could be a much better approach, and it will be interesting to see how accurate the results are on the Hunspell list. In the...
@loretoparisi Just checking in... Have you had any progress with this so far? It would be great to add Italian to the database once it's ready! :smile:
@loretoparisi Excellent, thanks! :+1: I have this version from before but will wait for the update to convert it and add to the database.
@TasseDeCafe This is fantastic! Thanks so much. Just a few questions before merging: 1. Is there a license for this data? 2. I assume you generated the IPA automatically --...
@TasseDeCafe Thanks very much for sharing this! :+1: It would be great to add data for Vietnamese, and it looks like the links you found have everything we would need...