Expedia Group Open Source
Expedia Group Open Source
flyte
Flyte binds together the tools you use into easily defined, automated workflows
circus-train
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
jarviz
Jarviz is dependency analysis and visualization tool designed for Java applications
kubernetes-sidecar-injector
Kuberbetes mutating webhook that injects a sidecar container to a pod
beeju
JUnit integration for testing the Apache Hive Metastore and HiveServer2 Thrift APIs
apiary
Apiary provides modules which can be combined to create a federated cloud data lake
apiary-data-lake
Terraform scripts for deploying Apiary Data Lake
avro-compatibility
A user friendly API for checking for and reporting on Avro schema incompatibilities.
beekeeper
Service for automatically managing and cleaning up unreferenced data
datasqueeze
Hadoop utility to compact small files