language-model topic
belgpt2
🇧🇪 BelGPT-2: the 1st GPT model pretrained in French.
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
mead-baseline
Deep-Learning Model Exploration and Development for NLP
Deep-NLP-Resources
Curated list of all NLP Resources
Language-games
Dead simple games made with word vectors.
gLM
A GPU language model, based on btree backed tries.
electra_pytorch
Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)