Russell Aronson
Russell Aronson
If the cluster is under a lot of load, client calls which directly interact with it can fail in weird ways. I have seen `LeaseExpiredException`, `ConnectException: Connection refused`, and others...
If both data sets are stored sorted on the join key, then its possible to perform the join on the map side. The general idea is to: - Build up...
We have hit some performance issues when performing relational joins because of key skew. Pig and hive have implemented optimizations for skewed joins, and it would be nice for something...
Avro supports type unions, which means its possible to have multiple types for a datum. One use that comes to mind is supporting optional values, eg. Option[AvroType] would have a...