DataOps topic
DataOps is an automated, process-oriented methodology, used by analytic and data teams, to improve the quality and reduce the cycle time of data analytics. While DataOps began as a set of best practices, it has now matured to become a new and independent approach to data analytics. DataOps applies to the entire data lifecycle from data preparation to reporting, and recognizes the interconnected nature of the data analytics team and information technology operations.
tenzir
Tenzir is the data pipeline engine for security teams.
awesome-dataops
Awesome list of dataops products, open source and resources
aws-glue-cdk-cicd
Build, Test and Deploy ETL solutions using AWS Glue and AWS CDK based CI/CD pipelines
awesome-ml-experiment-management
A curated list of awesome open source tools and commercial products for ML Experiment Tracking and Management 🚀
awesome-ml-monitoring
A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profiling data 🚀
awesome-data-management
A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀
pipelime-python
A swiss army knife for data processing!
hok-helm
HokStack - Run Hadoop Stack on Kubernetes
dataops-platform-airflow-dbt
Build DataOps platform with Apache Airflow and dbt on AWS
space
Unified storage framework for the entire machine learning lifecycle