Matthew Rocklin

Results 1122 comments of Matthew Rocklin

There is already this: https://examples.dask.org/machine-learning/text-vectorization.html Maybe we can roll this into that somehow? On Mon, Jul 27, 2020 at 7:51 AM review-notebook-app[bot] < [email protected]> wrote: > Check out this pull...

This seems like a sensible change to me. Would you be willing to add a test as well?

You can make tests that are 32/64 bit invariant, for example you can test something like the following: assert nbytes([x, x, x]) == nbytes(x) * 3 + nbytes([]) Where x...

I think that for our purposes overestimating or approximations are fine. On Sat, Sep 30, 2017 at 4:52 PM, Pascal van Kooten wrote: > It's pretty difficult to write a...

Yeah, I'm more concerned with fixing the links pointing to old or stale documentation. I agree that many of these probably came from docs that referred to addreses generally.

Well, I think that CI would also be grand, if only to make us aware of failures as they arise due to external changes. I'll take what I can get...

Something publicly accessible would be great. We could probably ask the Pangeo group to host some modest amount of data easily. cc @scottyhq , who might be interested in this...

Are you already familiar with dask's distributed scheduler at http://distributed.readthedocs.org/en/latest/ ?

The distributed scheduler maintains distributed data indexed by key in the active memory of multiple nodes. It doesn't persist to disk but easily could by running some function against the...