langdetect
langdetect copied to clipboard
Port of Google's language-detection library to Python.
I am willing to add Corsican language to the library. But In order to generate a language profile from plain text, I wonder if there exists a python port for...
"I love you" is being detected as sk language Is this normal? Are there any sanity tests to run?
Since `langdetect.__version__` is not defined how to do?
A lock is used to avoid a race condition on the global variable _factory: as soon as the line '_factory = DetectorFactory()' was executed, the other threads jumped the condition...
Should we take the init_factory() outside the detect() so that if we are using this function on dataframes or in loops, it won't have to load the 55 language files...
Does it support Brazilian Portuguese (pt-br)?
scores = langdetect.detect_langs(text) File "../venv/lib/python3.7/site-packages/langdetect/detector_factory.py", line 116, in detect_langs detector.append(text) File "../venv/lib/python3.7/site-packages/langdetect/detector.py", line 106, in append text = NGram.normalize_vi(text) File "../venv/lib/python3.7/site-packages/langdetect/utils/ngram.py", line 108, in normalize_vi return cls.ALPHABET_WITH_DMARK.sub(repl, text) File "../venv/lib/python3.7/site-packages/langdetect/utils/ngram.py",...