awesome-opensource-data-engineering icon indicating copy to clipboard operation
awesome-opensource-data-engineering copied to clipboard

An Awesome List of Open-Source Data Engineering Projects

Results 20 awesome-opensource-data-engineering issues
Sort by recently updated
recently updated
newest added

Added [Datumaro Dataset Management Framework](https://github.com/openvinotoolkit/datumaro) link. It can fit in several categories, please comment in which category(-ies?) it should be put.

I suggest adding the following: 1. Meltano by GitLab 2. dbt by Fishtown Analytics 3. Singer by StitchData (Talend) 4. Airbyte 5. ScyllaDB

I have added dbt by Fishtown Analytics. dbt is a SQL-first transformation workflow that lets teams quickly and collaboratively deploy analytics code following software engineering best practices like modularity, portability,...

Found this list through the AIDA user group, and have found a lot of value from these free books

its retired and no longer active, its good idea to remove it

Added workflow management tool mage.ai fixes #39

Adding info on StarRocks.