Dinesh Dutt

Results 52 comments of Dinesh Dutt

Blocked for the moment by the underlying framework limitation: https://github.com/facebookincubator/python-nubia/issues/14. Need to figure out more nuanced response.

pl.read_parquet doesn't seem to work with just a dir name even if pyarrow is installed for me. ``` In [5]: pl.read_parquet('../suzieq/play/donna/new-parquet/routes/', columns=['namespace', 'hostname', 'vrf', 'prefix']) --------------------------------------------------------------------------- RuntimeError Traceback (most recent...

Thanks @ritchie46 though that works, I can't stitch together key information that is present in the directory partition

Say, I have a parquet folder structured in hive format as follows: /parquet/namespace=foo/. Under this namespace folder, there are lots of folders with the name format hostname=(maybe a thousand) and...

One clunky way to make this work is to manually walk namespace and hostname folder and for each hostname folder manually add the namespace and hostname columns.

And a different error I get when I don't specify any columns (I want them all) is I get this error: ``` Any(ArrowError(NotYetImplemented("Decoding \"RleDictionary\"-encoded, dictionary-encoded optional V1 pages is not...

This is a fairly critical bug in my use case too, any idea when this'll be fixed?

Hi Ron, in our case, if we use just those strings, we terminate anytime any one of those characters is present in the output which is not uncommon. now regexs...

Thanks for the detailed info Ron.

> Can this be merged? Yes, we'll work on getting this done for a possible end-of-month release