awesome-opensource-data-engineering
awesome-opensource-data-engineering copied to clipboard
An Awesome List of Open-Source Data Engineering Projects
Added [Datumaro Dataset Management Framework](https://github.com/openvinotoolkit/datumaro) link. It can fit in several categories, please comment in which category(-ies?) it should be put.
I suggest adding the following: 1. Meltano by GitLab 2. dbt by Fishtown Analytics 3. Singer by StitchData (Talend) 4. Airbyte 5. ScyllaDB
I have added dbt by Fishtown Analytics. dbt is a SQL-first transformation workflow that lets teams quickly and collaboratively deploy analytics code following software engineering best practices like modularity, portability,...
Found this list through the AIDA user group, and have found a lot of value from these free books
its retired and no longer active, its good idea to remove it
Added workflow management tool mage.ai fixes #39
Adding info on StarRocks.