data-versioning topic
wandb
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
lakeFS
lakeFS - Data version control for your data lake | Git for data
dolt
Dolt – Git for Data
RecallGraph
A versioning data store for time-variant graph data.
quilt
Quilt is a data mesh for connecting people with actionable data
kart
Distributed version-control for geospatial and tabular data
data-versioning
Collecting thoughts about data versioning
gittargets
Data version control for reproducible analysis pipelines in R with {targets}.
sdk
Metadata store for Production ML
awesome-open-data-centric-ai
Curated list of open source tooling for data-centric AI on unstructured data.