Raghu Angadi
Raghu Angadi
If an application does not use use lzo, we should not require hadoop-lzo.jar or lzo native libs. There are probably some references that could be organized better.
The index for a an lzo file is read on the client while making the splits. for large inputs, this takes very long since the file are read serially. Some...
The default thrift deserializer is very lenient and ignores anything that does not quite make sense. The consumers almost always prefer Thrift deserializer to fail when a serialized record has...
It is annoying for users to see the following warnings multiple times in a script. We should either provide an option not to warn or limit it o once per...
Currently ThriftToPig does not handle some cases well because of lack of type info inside containers. e.g. : - if a struct as exception inside a container (as in map),...