yohplala
yohplala
(updated) Hi @martindurant , I was wondering if there is any possibility (or maybe it already exists?) to multithread the `write` function? I see some potential at 2 levels: -...
Hi @pyrito If i understand correctly, fastparquet already does that, maybe not straight away. ```python import fastparquet as fp import pandas as pd from os import path as os_path #...
PPS: if this answers your request, please, feel free to close the ticket
> I feel like this basic work flow could be achieved by dask. Of course, writing separate functionality is perfectly possible, but I wonder if it would end up as...
Yes, I think you wanted to react to #658 which is another ticket I wrote this morning.
> You could put it in a helper function with dedicated tests, to ensure that it behaves as expected with a range of inputs with or without the possibly redundant...
Thanks Martin for the feedback. > Of course, no other framework does this, so there is no template for it, and the optimisations you have in mind would have no...
PS: i rewrote fully the ticket. The story with sorted / not sorted column was an error. Sorry.
Hi Martin, I am not sure to understand your comment, but yes, I ought to have some free time available by end of year to propose implementations for some of...
Hi @martindurant Thanks for the feedback. At the moment, only reporting the bug, being focused on other topics ;). Bests,