Expedia Group Open Source

Results 38 repositories owned by Expedia Group Open Source

flyte

89
Stars
19
Forks
Watchers

Flyte binds together the tools you use into easily defined, automated workflows

circus-train

86
Stars
18
Forks
Watchers

Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.

jarviz

123
Stars
20
Forks
Watchers

Jarviz is dependency analysis and visualization tool designed for Java applications

kubernetes-sidecar-injector

73
Stars
33
Forks
Watchers

Kuberbetes mutating webhook that injects a sidecar container to a pod

beeju

24
Stars
8
Forks
Watchers

JUnit integration for testing the Apache Hive Metastore and HiveServer2 Thrift APIs

apiary

35
Stars
8
Forks
Watchers

Apiary provides modules which can be combined to create a federated cloud data lake

apiary-data-lake

18
Stars
25
Forks
Watchers

Terraform scripts for deploying Apiary Data Lake

avro-compatibility

57
Stars
11
Forks
Watchers

A user friendly API for checking for and reporting on Avro schema incompatibilities.

beekeeper

44
Stars
7
Forks
Watchers

Service for automatically managing and cleaning up unreferenced data

datasqueeze

18
Stars
7
Forks
Watchers

Hadoop utility to compact small files