cyavro
cyavro copied to clipboard
Cython based wrapper for libavro
Looks as though issue #9 may have popped up again in cyavro version `0.7.0`? ``` /opt/miniconda/envs/combine/lib/python3.5/site-packages/cyavro/__init__.py in (.0) 499 elif issubclass(tp, dict): 500 # get value element type --> 501...
I just started using cyavro today, and it's wonderful so far. It precisely fills a need to parse a directory of avro files -- quickly -- into a panda dataframe....
Since avro 1.8.0 support for datetime has been added to the format. We should support reading that. _Sent from my Google Pixel using [FastHub](https://play.google.com/store/apps/details?id=com.fastaccess.github)_
In order to allow parallel reading by blocks, would prefer to have the class be able to read the header on instantiation, and provide the schema and sync marker. This...
The current method is to create numpy arrays (or lists) for a given chunk of a given block. The creation of a pandas dataframe from numpy arrays, and the concatenation...
I the C variables are global, the reader will not be thread-safe. If, however, everything is in instance variable, we should be OK. This needs investigation.
thanks @mariusvniekerk