epitran icon indicating copy to clipboard operation
epitran copied to clipboard

Lao wrong transliteration

Open aleray opened this issue 11 months ago • 2 comments

Hi,

your project looks wonderful. I came across it while looking for Lao romanization. I think there is a mistake in the following example...

import epitran
epi = epitran.Epitran('lao-Laoo')
res = epi.transliterate(u'ຂ້ອຍມັກກາເຟ')
print(res)
'kʰɔːɲmakkaːfeː'

rom = epitran.reromanize.ReRomanizer('lao-Laoo', 'anglocentric')
print(rom.reromanize(res))
'koonymakkaafee'

the characters "ອ" and "ຍ" in "ຂ້ອຍ" are consonants, but together — and in this case — they form the (complex) vowel/diphtong "ອຍ". See: https://r12a.github.io/scripts/laoo/lo#diphthongsz.

I think the "ຍ" got treated here as the consonants "ny", but should not. I'm not a specialist so please forgive me if I'm wrong.

Thanks

aleray avatar Nov 02 '24 10:11 aleray