polyglot icon indicating copy to clipboard operation
polyglot copied to clipboard

Transliteration error

Open matanox opened this issue 6 years ago • 3 comments

Code like this:

blob = u"""גאבאפנטין"""
text = Text(blob)

for x in text.transliterate("en"):
  print(x)

ValueError: Package 'transliteration2.iw' not found in index

If we change the input string to one in e.g. Arabic, a slightly different error message:

ValueError: Package 'transliteration2.sd' not found in index

but there seems to be no such packages to download, or it is unclear how to obtain them. Other transliteration2 packages download just fine and do facilitate transliteration from English to the downloaded target language. Is transliteration to English supported?

matanox avatar May 31 '18 15:05 matanox

Well, based on the quoted research article, I think the answer may be no. On the other hand, the nature of the errors may hint that they are coming from incorrect source language identification.

matanox avatar May 31 '18 15:05 matanox

https://polyglot.readthedocs.io/en/latest/Download.html download this model for me makes it work!

bhx98 avatar Jan 08 '22 09:01 bhx98

I downloaded the .he file and manually changed the filenames to .iw - worked with no issue

marksworn avatar Apr 02 '23 15:04 marksworn