Russell Aronson

Results 4 issues of Russell Aronson

If the cluster is under a lot of load, client calls which directly interact with it can fail in weird ways. I have seen `LeaseExpiredException`, `ConnectException: Connection refused`, and others...

If both data sets are stored sorted on the join key, then its possible to perform the join on the map side. The general idea is to: - Build up...

New Feature

We have hit some performance issues when performing relational joins because of key skew. Pig and hive have implemented optimizations for skewed joins, and it would be nice for something...

New Feature

Avro supports type unions, which means its possible to have multiple types for a datum. One use that comes to mind is supporting optional values, eg. Option[AvroType] would have a...

Improvement