unidecode icon indicating copy to clipboard operation
unidecode copied to clipboard

Preserve isn't preserving the original word in a multilingual content (utf-8 file).

Open mxav1111 opened this issue 1 year ago • 0 comments

Hi there, It seems that when english (with dialects) and hindi words (in proper hindi language) are in the document, it is messing up with hindi words and actually converting hindi words to english words when preserve parameter is used. What it should be doing is only convert those english accented words and leave the rest (preserve). It is properly removing accents from English words as such but then also messes up words written in hindi language.

Hope my understanding of preserve parameter is correct. If not, then I apologize and if you have any suggestions to overcome this situation, please suggest.

Thanks for your help.

mxav1111 avatar Jun 22 '24 15:06 mxav1111