Logan Adams

Results 294 comments of Logan Adams

Hi @SkyAndCloud - any updates on this?

Interesting, I'm curious if this repros with the latest deepspeed as well, but since I'm not able to repro this and your code is running successfully I'll close it for...

Hi @paveltonev - your issue is probably more similar to this one? https://github.com/microsoft/DeepSpeed/issues/3794 Given the fact that it is an error and not a warning, and that you are using...

@KeeratKG - the error is here: ``` AssertionError: CUDA_HOME does not exist, unable to compile CUDA op(s) ``` It seems you'd need to set your CUDA_HOME env var. However, this...

If you are using conda, we have an environment.yaml [here ](https://github.com/microsoft/DeepSpeed/blob/master/environment.yml)that you can use and has worked for others. But to debug, I would try the following: ``` nvcc --version...

@amitportnoy and @susht3 - I'm not sure I understand what your issue is, could you elaborate? [Here ](https://github.com/microsoft/DeepSpeed/blob/78c3b148a8a8b6e60ab77a5c75849961f52b143d/.github/workflows/nv-torch110-v100.yml#L25C3-L30C1)is the sample of how we install a specific torch/cuda version in our...

Hi @kiddyboots216 - the warning is thrown from [here ](https://github.com/microsoft/DeepSpeed/blob/78c3b148a8a8b6e60ab77a5c75849961f52b143d/op_builder/builder.py#L341)which is in the op_builder. Can you try, when you install DeepSpeed, running `DS_BUILD_CPU_ADAM=1 pip install deepspeed` so the ops will...

Interesting, thanks for the info @kiddyboots216 - could you share how much memory you needed?

@wuhongyan123 - I'm not sure that is the full error, but it looks like something is wrong with your setup, since these errors aren't related to python/DeepSpeed, I'd check that...

Running CI here: https://github.com/microsoft/DeepSpeed/actions/runs/8474231174