language-model topic

List language-model repositories

AutoenCODE

60
Stars
16
Forks
Watchers

AutoenCODE is a Deep Learning infrastructure that allows to encode source code fragments into vector representations, which can be used to learn similarities.

nucliadb

587
Stars
45
Forks
Watchers

NucliaDB, The AI Search database for RAG

gdc

116
Stars
23
Forks
Watchers

Code accompanying our papers on the "Generative Distributional Control" framework

TextRL

539
Stars
61
Forks
Watchers

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

KenLM-training

111
Stars
21
Forks
Watchers

Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2

COCO-LM

120
Stars
12
Forks
Watchers

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

CoLAKE

114
Stars
17
Forks
Watchers

COLING'2020: CoLAKE: Contextualized Language and Knowledge Embedding

PhoNLP

131
Stars
18
Forks
Watchers

PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)

Romanian-Transformers

89
Stars
6
Forks
Watchers

This repo is the home of Romanian Transformers.