datacuration topic

List datacuration repositories

scholia

212
Stars
77
Forks
Watchers

Wikidata-based scholarly profiles

library

180
Stars
6
Forks
Watchers

70+ CLI tools to build, browse, and blend your media library. An index for your archive.

data-prep-kit

235
Stars
122
Forks
Watchers

Open source project for data preparation of LLM application builders

NeMo-Curator

542
Stars
71
Forks
Watchers

Scalable data pre processing and curation toolkit for LLMs