aksharamukha icon indicating copy to clipboard operation
aksharamukha copied to clipboard

Word final ih, īh, ēh, aih, uh, and ṃ/ṁh rules not followed for Urdu/Shahmukhi

Open bgo-eiu opened this issue 2 years ago • 0 comments

This is a somewhat counterintuitive rule of the orthography, but it is a consistent one. In the Perso-Arabic based writing systems ہ doubles as the consonant h and as a vowel where it occurs word finally. (The main reason this occurs is to distinguish words that look the same without diacritics. patā is پتہ while pattā is پتا. Determining where ہ is a vowel is language specific and out of scope for this converter however.)

Due to this, a double ہہ is used to represent h as a consonant in certain word final sequences to avoid ambiguity. These are each of the sequences which may be implemented:

  • ih (Punjabi, “to say”) ISO 15969: kih Gurmukhi: ਕਿਹ Shahmukhi: کِہہ Explanation: Currently this produces کِہَ which would be read as ki or ਕਿ which is a different word.

  • īh (Punjabi, “twenty”) ISO 15969: vīh Gurmukhi: ਵੀਹ Shahmukhi: وِیہہ Explanation: Currently this produces وِیہ٘ which would be read as vīā or ਵੀਆ

  • ēh (Punjabi, “yoghurt”) ISO 15959: dēh Gurmukhi: ਦੇਹ Shahmukhi: دیہہ Explanation: Currently produces دیہَ which would be diā or ਦਿਆ

  • aih (Punjabi, “protection”) ISO 15969: śaih Gurmukhi: ਸ਼ੈਹ Shahmukhi: شَیہہ Explanation: Currently produces شَیہَ which is śayā or ਸ਼ਯਾ

  • uh (Punjabi, “multiple”) ISO 15969: buh Gurmukhi: ਬੁਹ Shahmukhi: بُہہ Explanation: Currently produces بُہ which would be bu or ਬੁ

  • ṃ/ṁh (Punjabi, “arm”) ISO 15969: bāṃh Gurmukhi: ਬਾਂਹ Shahmukhi: بانہہ

(Punjabi, “mouth, face”) ISO 15969: mūṁh Gurmukhi: ਮੂੰਹ Shahmukhi: مُونہہ

Explanation: This applies to any word final homorganic nasal before h.


Note that these rules do not carry over to word final ūh or oh. ਰੂਹ = رُوہ and ਰੋਹ = روہ

bgo-eiu avatar Jan 22 '23 16:01 bgo-eiu