langid.py
langid.py copied to clipboard
Detection error when processing full-width letters
When processing full-width letters, it returns "Chinese" as result:
>>> import langid
>>> langid.classify('ABC')
('zh', 0.9668056948707975)
ABCis Double-byte characters