language-classification topic

List language-classification repositories

lingua-go

1.1k
Stars
64
Forks
Watchers

The most accurate natural language detection library for Go, suitable for short text and mixed-language text

lingua-rs

838
Stars
35
Forks
Watchers

The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

lingua

663
Stars
60
Forks
Watchers

The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

lingua-py

961
Stars
42
Forks
Watchers

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

RNN-Language-Classifier

55
Stars
12
Forks
Watchers

A Language Classifier powered by Recurrent Neural Network implemented in Python without AI libraries. AI from scratch.

ungoliant

152
Stars
14
Forks
Watchers

:spider: The pipeline for the OSCAR corpus

goclassy

85
Stars
6
Forks
Watchers

An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.

GlotLID

86
Stars
7
Forks
Watchers

GlotLID: Language Identification with Support for More Than 2000 Labels -- EMNLP 2023