data-processing-pipelines topic

List data-processing-pipelines repositories

convtools

38
Stars
2
Forks
Watchers

convtools is a specialized Python library for dynamic, declarative data transformations with automatic code generation

NeMo-Curator

542
Stars
71
Forks
Watchers

Scalable data pre processing and curation toolkit for LLMs