language-classification topic
lingua-go
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
lingua-rs
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
lingua
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
lingua-py
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
RNN-Language-Classifier
A Language Classifier powered by Recurrent Neural Network implemented in Python without AI libraries. AI from scratch.
ungoliant
:spider: The pipeline for the OSCAR corpus
goclassy
An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.
hyperdimensional-computing
Hyperdimensional computing explained and demonstrated
GlotLID
GlotLID: Language Identification with Support for More Than 2000 Labels -- EMNLP 2023