langdetect topic
List
langdetect repositories
spacy-langdetect
92
Stars
6
Forks
Watchers
A fully customisable language detection pipeline for spaCy
zabanshenas
17
Stars
1
Forks
Watchers
Zabanshenas is a solution for identifying the most likely language of a piece of written text. Demo (👇 )
go-pkg-spider
212
Stars
9
Forks
Watchers
一个 Golang 实现的相对智能、无需规则维护的通用新闻网站数据提取工具库。含域名探测、网页编码语种识别、网页链接分类提取、网页新闻要素抽取以及新闻正文抽取等组件。
split-lang
27
Stars
3
Forks
Watchers
✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and langua