data-infrastructure topic

List data-infrastructure repositories

spark-json-schema

81
Stars
43
Forks
Watchers

JSON schema parser for Apache Spark

darty

22
Stars
3
Forks
Watchers

Data dependency manager

data-machinelearning-the-boring-way

53
Stars
10
Forks
Watchers

Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.

kanadi

29
Stars
19
Forks
Watchers

Kanadi is a Nakadi client for Scala

stream-zip

110
Stars
9
Forks
Watchers

Python function to construct a ZIP archive on the fly

sqlite-s3vfs

121
Stars
9
Forks
Watchers

Python writable virtual filesystem for SQLite on S3

mobius3

47
Stars
3
Forks
Watchers

Continuously sync folder to S3, using inotify under the hood

fargatespawner

44
Stars
21
Forks
Watchers

Spawns JupyterHub single user servers in Docker containers running in AWS Fargate

data-workspace-frontend

43
Stars
23
Forks
Watchers

An open source data analysis platform with features for users with a range of technical skills