Results 20 repositories owned by Nathan Marz

basic-specter

23
Stars
1
Forks
Watchers

Implementation of core of Specter without any optimizations – a reference to understand the basics of how Specter works

cascading-batch-query

21
Stars
3
Forks
Watchers

Optimized joins using bloom filters on Hadoop via Cascading.

cascalog

1.4k
Stars
181
Forks
Watchers

Data processing on Hadoop without the hassle.

cascalog-contrib

47
Stars
17
Forks
Watchers

cascalog-demo

27
Stars
10
Forks
Watchers

A short Cascalog program that produces a simplified version of a Facebook-like news feed.

cascalog-workshop

18
Stars
4
Forks
Watchers

Materials for Cascalog workshop

dfs-datastores

215
Stars
82
Forks
Watchers

Dead-simple vertical partitioning, compression, appends, and consolidation of data on a distributed filesystem.

elephantdb

553
Stars
56
Forks
Watchers

Distributed database specialized in exporting key/value data from Hadoop

elephantdb-cascalog

18
Stars
2
Forks
Watchers

Seamless integration of ElephantDB with Cascalog

kafka-deploy

123
Stars
30
Forks
Watchers

Automated deploy for Kafka on AWS