Ma, Guokai
Ma, Guokai
I read the initial description. From ds_report it looks like @weiji14 is reporting a non-CUDA system (the ops in ds_report) indicates CPU accelerator are selected. Is this intended? If running...
@whois206 oneccl-devel package does not have Windows support. I checked oneCCL product page https://www.intel.com/content/www/us/en/developer/tools/oneapi/oneccl-download.html, there seems no Windows system download package.
Hi @rsxdalv you can override DeepSpeed device detection by set environment variable `DS_ACCELERATOR=cuda` before install and run DeepSpeed. Besides can you post output of 'pip list' in your environment? I...
one suggestion is not to build ops at installation time and let opbuilder build just in time. In that way only essential ops will be built.
I get same error and noticed that cpu_adam.so didn't get properly built. In my case it seems a missing depedency. You can scroll back further to see what caused the...
> @delock that's not the case with me. I tried `DS_BUILD_CPU_ADAM=1 pip install deepspeed` and I got > > ``` > gcc -pthread -B /opt/shared/anaconda/2024.02/compiler_compat -DNDEBUG -fwrapv -O2 -Wall -fPIC...
> Hi @delock I followed your advice. The compilation is stuck at some point when building from source or when using pip. This is when using > > ```shell >...
This specific error indicates that gcc has encountered some serious error. I ran the same command on my system and I see gcc spend several seconds after the unroll warning...
> Thanks @delock I am using a controlled environment with VALE. I switch between GCC versions but usually I use 11.2.0. Other info: > > ``` > DeepSpeed general environment...
I encountered the build paused several seconds but never stuck. I saw CPU adam has some new feature last month. Hi @BacharL did you encountered anything abnormal during building process...