epitran
epitran copied to clipboard
Lao wrong transliteration
Hi,
your project looks wonderful. I came across it while looking for Lao romanization. I think there is a mistake in the following example...
import epitran
epi = epitran.Epitran('lao-Laoo')
res = epi.transliterate(u'ຂ້ອຍມັກກາເຟ')
print(res)
'kʰɔːɲmakkaːfeː'
rom = epitran.reromanize.ReRomanizer('lao-Laoo', 'anglocentric')
print(rom.reromanize(res))
'koonymakkaafee'
the characters "ອ" and "ຍ" in "ຂ້ອຍ" are consonants, but together — and in this case — they form the (complex) vowel/diphtong "ອຍ". See: https://r12a.github.io/scripts/laoo/lo#diphthongsz.
I think the "ຍ" got treated here as the consonants "ny", but should not. I'm not a specialist so please forgive me if I'm wrong.
Thanks