langdetect topic

List langdetect repositories

spacy-langdetect

92
Stars
6
Forks
Watchers

A fully customisable language detection pipeline for spaCy

zabanshenas

17
Stars
1
Forks
Watchers

Zabanshenas is a solution for identifying the most likely language of a piece of written text. Demo (👇 )

go-pkg-spider

212
Stars
9
Forks
Watchers

一个 Golang 实现的相对智能、无需规则维护的通用新闻网站数据提取工具库。含域名探测、网页编码语种识别、网页链接分类提取、网页新闻要素抽取以及新闻正文抽取等组件。

split-lang

27
Stars
3
Forks
Watchers

✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and langua