text-data topic

List text-data repositories

texar

2.4k
Stars
371
Forks
Watchers

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

cotk

128
Stars
27
Forks
Watchers

Conversational Toolkit. An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation

DialoGPT

2.3k
Stars
342
Forks
Watchers

Large-scale pretraining for dialogue

texar-pytorch

744
Stars
119
Forks
Watchers

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/

GODEL

839
Stars
110
Forks
Watchers

Large-scale pretrained models for goal-directed dialog

forte

236
Stars
60
Forks
Watchers

Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/

redditcleaner

78
Stars
2
Forks
Watchers

Cleans Reddit Text Data :scroll: :broom:

textreadr

73
Stars
5
Forks
Watchers

Tools to uniformly read in text data including semi-structured transcripts

rake_new2

29
Stars
20
Forks
Watchers

A Python library that enables smooth keyword extraction from any text using the RAKE(Rapid Automatic Keyword Extraction) algorithm.

wordmap

24
Stars
5
Forks
Watchers

Visualize large text collections with WebGL