Brandyn White
Brandyn White
I apologize for the cross post but this is how I fixed this problem in Hadoopy http://bwhite.github.com/hadoopy/#pipe-hopping-using-stdout-stderr-in-hadoopy-jobs
I switched my python strings to typecode 0 which will still keep compatibility but I haven't forced typecode 7 to be utf-8 (not automatically decoded) which could raise errors.
Do you recall what operations were slow using unicode? Was it decoding utf-8 or general string operations?
I would help with this if there is interest. The purpose of Hadoopy isn't to recreate this functionality, it is to create a thin core python interface for streaming. I...