Florian Jetter comments

Results 237 comments of


                                            Florian Jetter

JIT graph building

Thanks for your report. Before I get into this, what you call "compilation", i.e. the process of building the "low level graph" / initializing all tasks, is what we typically...

JIT graph building

Apart from the materialization issues, your example is generating a graph of about 4MM tasks. This is larger than what we are typically dealing with and especially if you are...

JIT graph building

> layers, it should be the case that in the example above, the first block can be computed without actually materializing the other (unnecessary for that block) parts of the...

JIT graph building

@rjzamora maybe you got some insights here?

RFC: Rename Future to Task in public facing interface

I actually don't have a very strong opinion here. I'm mildly concerned about confusion internally and about migration efforts (for users). If we believe this is worth it, I'm fine...

Pickling error with numba gufunc

Sorry for the late reply. I can reproduce this. here is another reproducer (note that `jit` works) ```python from numba import guvectorize, int64, jit @jit(nopython=True, cache=True) def f(x, y): return...

Pickling error with numba gufunc

Sorry, I believe the error I am reproducing with my example is actually a little different. My example only fails if the `cache` kwarg is `True`. The dask-examples version fails...

Call the persist method on an array after using map_blocks and sending a Python object never release the copies of the object created for every chunk

I think this is somewhat expected. The object `mapper` you are defining requires about 250kB in memory. When calling `map_blocks` this will generate _tasks_ internally that embed whatever arguments you're...

Call the persist method on an array after using map_blocks and sending a Python object never release the copies of the object created for every chunk

Typically it is expected that a task's definition is small and we are retaining the task until the computation is completed. Specifically, it is retained for as long a local...

Call the persist method on an array after using map_blocks and sending a Python object never release the copies of the object created for every chunk

In these situations we recommend one of two things 1. Ideally whatever data is in mapper can either be generated inside of a dask task or loaded from a remote...