language-model topic

List language-model repositories

nlp_chinese_corpus

9.2k
Stars
1.5k
Forks
Watchers

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

haystack

16.9k
Stars
1.8k
Forks
130
Watchers

:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your d...

zeroth

345
Stars
122
Forks
Watchers

Kaldi-based Korean ASR (한국어 음성인식) open-source project

bert_language_understanding

959
Stars
211
Forks
Watchers

Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN

bluebert

538
Stars
76
Forks
Watchers

BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).

AzureML-BERT

388
Stars
126
Forks
Watchers

End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service

RobBERT

193
Stars
28
Forks
Watchers

A Dutch RoBERTa-based language model

tokenizers

8.6k
Stars
737
Forks
Watchers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

keras-bert

2.4k
Stars
510
Forks
Watchers

Implementation of BERT that could load official pre-trained models for feature extraction and prediction

spacy-transformers

1.3k
Stars
161
Forks
Watchers

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy