langdetect icon indicating copy to clipboard operation
langdetect copied to clipboard

Inaccurate predictions for basic english words

Open grestonian opened this issue 4 years ago • 1 comments

library is unable to detect language for basic english words and hence generates poor inaccurate results as depicted below. detect("sunday") => 'id' | whereas clearly 'sunday' in indonesian is minggu detect("monday") => 'tr' | whereas 'monday' in turkish is 'pazartesi' and surprisingly, detect('pazartesi') => 'es'

Infact, langdetect.deteect_langs("sunday") outputs confidences for 'tr' and 'id', and no mention of english whatsoever. same goes for months, and other basic english words, eg detect("good") => 'so

grestonian avatar Mar 29 '20 20:03 grestonian

"son", "song",...

nightfuryyy avatar Aug 03 '20 03:08 nightfuryyy