Logan Adams
Logan Adams
@laladong - this is an all reduce, what more/specifically were you looking for?
Hi @oelayan7 - thanks for the contribution, could you take a look at the failing tests?
Hi @1148514800 - do you have nvcc installed? Are you able to run `which nvcc` or `nvcc --version`
Thanks @1148514800 - that's good, I'm glad the issue is resolved and sorry for the delay in replying. I'll close the issue now but if you hit any others, please...
Hi @pacman100 - thanks for making this issue here to better track it. Does this also happen with the latest changes in the master branch?
> I found a workaround. Just manually patching your `runtime/zero/stage3.py` according to [PR 5461](https://github.com/microsoft/DeepSpeed/pull/5461) will fix everything. @lihe07 - so using the latest deepspeed built from source works? You don't...
Hi @MinFFFF - can you please share the full error that you see when you run `DS_BUILD_FUSED_ADAM=1 pip install deepspeed`? As well as information about your system/accelerator.
Hi @daehuikim - are you able to run `pip install deepspeed` with no errors? And do you hit any errors when installing other ops? It appears that your system is...
I see, is there a reason that you need to precompile the ops? Since you should be able to run DeepSpeed on the GPU nodes and it will detect the...
Thanks @daehuikim - that makes sense, since it currently believes your environment is a CPU environment on your master node, so it believes that it can only run certain ops...