Martin Durant

Results 2772 comments of Martin Durant

I guess I'll put it somewhere on the my list, but I don't anticipate anything immediate.

@datapythonista , I have implemented this here: https://github.com/martindurant/mnist-data-intake Note that this requires the latest master version of Intake (because the path munging while decompressing has a bug), so the package...

Yes, I considered that, and initially started the work as a PR, but made a separate repo because the main aim of your repo is installable code to be executed...

Hm, I run it many times locally and it always passes, which makes it annoying! Would it be something like the metadata file hasn't made it to the disk yet?

I think it may be that pyarrow is taking the index from global metadata and fastparquet is taking it from the individual files, allowing for some randomisaton. I am not...

Related code suggestion in https://github.com/dask/dask/issues/6074

[here](https://github.com/fsspec/filesystem_spec/blob/master/fsspec/core.py#L675) ? Yes, it's clearly something that could be optional and would make sense to surface. Seems like `name_function` was indeed meant for this - but doesn't fit the bill....

I don't believe that your asyncio event loop (`_loop`) is ever started. You would want to run that on its own thread, or run the entire application asynchronously with `asyncio.run()`.

Yes, you would think that you can pass any type compatible with that MemoryFile interface, where methods to return sets of bytes from the target were implemented to read on...