lingua
lingua copied to clipboard
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Related to #119 Currently Lingua uses `ForkJoinPool.commonPool()` for model loading and language detection. However, maybe it would be useful to allow users to specify their own [`Executor`](https://docs.oracle.com/en/java/javase/17/docs/api/java.base/java/util/concurrent/Executor.html), for example with...
With the recent changes of using [`ForkJoinPool.commonPool()`](https://docs.oracle.com/en/java/javase/17/docs/api/java.base/java/util/concurrent/ForkJoinPool.html#commonPool()) (and possibly other changes as well), Lingua now requires at least JDK 8+. I am not sure if that was intentional; the README...
Detection of long texts (or usage of `withLowAccuracyMode()`) only uses a single worker thread for language detection. The reason for this is that a [work task per ngram length](https://github.com/pemistahl/lingua/blob/682da180106fa63dacfb945c473c06ba7d73b254/src/main/kotlin/com/github/pemistahl/lingua/api/LanguageDetector.kt#L160) is...
Hello, this is a request to make this great project available to a bigger community. It would be great if you could also make this project available on NuGet. Many...
Hello! So I've been trying out the lingua library and it's awesome. Was wondering if it's possible to add other classification metrics such as `Precision`, `Recall`, `Specificity` and `F1` in...
Currently, for a given input string, only the most likely language is returned. However, if the input contains contiguous sections of multiple languages, it will be desirable to detect all...
Removes `version =` because it seems to reassign the project version instead of configuring the task (the task has no setter for the module info version, it uses the project...
with a unique alphabet
Bumps [it.unimi.dsi:fastutil](https://github.com/vigna/fastutil) from 8.5.12 to 8.5.13. Changelog Sourced from it.unimi.dsi:fastutil's changelog. 8.5.13 Thanks to Chanoch Goldfeder for fixing a number of bugs in ImmutableList. Thanks to Barak Ugav for implementing...