langid.py
langid.py copied to clipboard
Dead Languages
Hi, I just stumbled over langid and then, when trying how suitable it'd be for my purposes, stumbled over this:
❯ echo 'در' | langid -l ar,fa,ota
Traceback (most recent call last):
File "/home/jrs/.local/bin/langid", line 8, in <module>
sys.exit(main())
File "/home/jrs/.local/lib/python3.9/site-packages/langid/langid.py", line 504, in main
identifier.set_languages(langs)
File "/home/jrs/.local/lib/python3.9/site-packages/langid/langid.py", line 245, in set_languages
raise ValueError("Unknown language code %s" % lang)
ValueError: Unknown language code ota
What is the project's policy towards ISO-693-2 (as opposed to ISO-693-1 only?). Any chance there'll be support for three-letter codes such as the ota
of this example at some point? Or at least a trace-less error message? :-)