Plume
Plume copied to clipboard
Explorations relative to cloning FlumeJava
I would like to be able to specify a two writable classes to receive the contents of a sequence file. This is similar to the way that Avro naturally reads...
This would be the meaning of the records() type specifier. The schema would come from the input file and the values would be generic avro objects that would need to...
We should be able to execute map-reduce programs very efficiently on a local machine using threads and the optimizer.
The natural set would be something like Pigmix or Gridmix2. I would tend to add k-means to the list of algorithms. The goal would be to measure completion of the...