Benjamin Zaitlen comments

Results 198 comments of


                                            Benjamin Zaitlen

[BUG] LocalCUDACluster doesn't work with NVIDIA MIG

We've ran into perm issues in the past, though MIG things might required something else. Possible solutions are documented here: https://github.com/gpuopenanalytics/pynvml#nvml-permissions

Start CUDAWorker without a Nanny

If we had the no-nanny option, at least for the CLI, users could then build workers manually > CUDA_VISIBLE_DEVICES=0 UCX_OPTION_FOO dask-cuda-worker ucx:// --no-nanny > CUDA_VISIBLE_DEVICES=1 UCX_OPTION_FOO dask-cuda-worker ucx:// --no-nanny

Test Pack/Unpack w/ cuDF Merge

Thanks for following up @charlesbluca

Using cuda cluster on HPC: Implementation question for a PBSCudaCluster/Job using LocalCUDACluster

I think it's a bit of work but definitely doable! 1. Before getting to cuda-workers, I _think_, we first need to resolve starting the scheduler in the cluster. Right now...

Using cuda cluster on HPC: Implementation question for a PBSCudaCluster/Job using LocalCUDACluster

Dask on MPI systems are used and used quite a bit more than expected. You might be interested in the [dask-mpi](https://mpi.dask.org/en/latest/howitworks.html) project. @jacobtomlinson have you used `dask-mpi` to start a...

[FEA] Consider using different default values for cluster configurations

I think having some of these defaults makes sense . In dask core `--memory-limit` is set to `auto` by [default](https://distributed.dask.org/en/latest/worker.html) . ```python In [1]: import pynvml In [2]: pynvml.nvmlInit() In...

Problems using preload to get scheduler address

@ntabris are you able to move forward ?

Problems using preload to get scheduler address

Still an issue

[BUG] sync client constructor hangs on connecting to an async localcudacuster

The context statement is incorrectly stated: > async with await LocalCUDACluster Should be > async with LocalCUDACluster

[BUG] sync client constructor hangs on connecting to an async localcudacuster

Can you describe a bit more of the use case you have and why you might need both blazingsql and an additional async client ?