Gary
Gary
That does seem the safe approach. What about for minSplitSizeNode and minSplitSizeRack? I'd think to extract from conf using CFIF config keys, then set them into EB's SplitUtil conf keys...
I've created a pull request: https://github.com/kevinweil/elephant-bird/pull/420
It looks like the build is failing when using -P hadoop-old due to: [ERROR] Failed to execute goal on project hadoop-lzo: Could not resolve dependencies for project com.hadoop.gplcompression:hadoop-lzo:jar:0.4.20-SNAPSHOT: Could not...
I've added a configuration option for what size of a file should be considered "small." By default it is Long.MIN_VALUE, which should preserve current behavior if it is not specified....
@gerashegalov @sjlee Thoughts?
@dvryaboy It looks like a previous pull request https://github.com/twitter/hadoop-lzo/pull/82 did something similar, but was also never merged. It's possible the change you're talking about is in elephantbird instead of hadoop-lzo,...