datasets topic

List datasets repositories

pytorch-cpp

1.9k
Stars
254
Forks
Watchers

C++ Implementation of PyTorch Tutorials for Everyone

audino

1.0k
Stars
121
Forks
Watchers

Open source audio annotation tool for humans

deeplake

8.1k
Stars
615
Forks
Watchers

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....

datasets

4.2k
Stars
1.5k
Forks
Watchers

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

dffml

243
Stars
135
Forks
Watchers

The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.

loghub

1.6k
Stars
573
Forks
Watchers

A large collection of system log datasets for AI-driven log analytics [ISSRE'23]

CLUEDatasetSearch

3.9k
Stars
597
Forks
Watchers

搜索所有中文NLP数据集,附常用英文NLP数据集

Chinese-NLP-Corpus

854
Stars
207
Forks
Watchers

Collections of Chinese NLP corpus

awesome-public-datasets

65.1k
Stars
10.3k
Forks
Watchers

A topic-centric list of HQ open datasets.