hdfs3
hdfs3 copied to clipboard
add crc=True|False parameter to HDFileSystem(...)
Title says it all. I also added instructions for testing on Python 2.7 to the CI README. Test is a bit long-winded, comments/improvements welcome
I am stumped! I have no idea why the compression should matter, since the values encoded in the path by partition_on are not compressed at all. That is might depend on type of value is not as surprising, since fastparquet attempts the convert the (string) values encoded in the path into whatever the original pandas type was, and so for in
, the types would need to match to pass the filter. You can check what was inferred with
pf = fastparquet.ParquetFile(..)
pf.cats
See the function filter_out_cats
for how the values get used for comparison.
Hi @martindurant Thanks for the feedback. At the moment, only reporting the bug, being focused on other topics ;). Bests,