Guido Imperiale
Guido Imperiale
If I understood correctly: 1. x is in memory on a 2. a receives the stimulus to compute y, which depends on x. y transitions to executing. 3. before y...
Actually - I don't think you need to change the graph. You can get this error if you resubmit the same exact tasks. 1. x is in memory on a...
Found a nice textbook example in the distributed tests https://github.com/dask/distributed/pull/8185/files#r1479864806
(repost from https://github.com/dask/distributed/pull/8185#issuecomment-1930341114) Found another use case: ```python ddf = dd.from_pandas(df, npartitions=4) with dask.config.set({"dataframe.shuffle.method": "p2p"}): ddf = ddf.set_index("a").sort_values("b") result = ddf.compute() ``` This code: 1. sends some keys to the...
lock=True works exclusively with the threaded scheduler. It will work with neither multiprocessing nor distributed. At the very least, the documentation of da.store should clarify that. It looks like you...
Re-opening as the current implementation of set always writes to dask.yaml, whereas the ticket correctly points out that it would be better to write to the correct config file (distributed.yaml...
> I was under the impression that opening a file for writing will lock that file Maybe on NFS? Definitely there are no locks on local file system on Linux...
WHAM! Self-contained POC that falls over after less than 30 seconds 🥇 ```python import importlib import os import shutil from concurrent.futures import ProcessPoolExecutor from textwrap import dedent os.chdir(os.path.abspath(os.path.dirname(__file__))) def f(i,...
Hello, any update on this?
Hi, any info on when this will be available in master? Thanks