apex
apex copied to clipboard
pipeline_parallel - ModuleNotFoundError: No module named 'amp_C'
Hi,
I was using apex and everything worked well until this week when I get the following error:
---> 39 from apex import amp
---> 20 from . import transformer
----> 3 from apex.transformer import pipeline_parallel
----> 1 from apex.transformer.pipeline_parallel.schedules import get_forward_backward_func
----> 4 from apex.transformer.pipeline_parallel.utils import get_num_microbatches
---> 23 import amp_C
ModuleNotFoundError: No module named 'amp_C'
I saw this error occurs in other scenarios (https://github.com/NVIDIA/apex/issues/865), but not at the same place. Do you know how to solve it (PS: I'm running the code in Google colab). Thanks,
I have the same issue in pipeline_parallels, same stack trace and everything
I am on the Python-only build of apex
I have the same issue. The workaround that worked for me is to use the version prior to the last commit here: https://github.com/NVIDIA/apex/tree/3303b3e7174383312a3468ef390060c26e640cb1
As you can see on the commit history the pipeline_parallels was added on October 27th here: https://github.com/NVIDIA/apex/pull/1202 I don't know what the issue might be but I hope the woraround works for you aswell.
I have the same issue. The workaround that worked for me is to use the version prior to the last commit here: https://github.com/NVIDIA/apex/tree/3303b3e7174383312a3468ef390060c26e640cb1
Yes this worked for me as well, thank you!
I have the same issue. The workaround that worked for me is to use the version prior to the last commit here: https://github.com/NVIDIA/apex/tree/3303b3e7174383312a3468ef390060c26e640cb1
As you can see on the commit history the pipeline_parallels was added on October 27th here: #1202 I don't know what the issue might be but I hope the woraround works for you aswell.
How to use the prior version of apex? Are the install commands still below:
git clone https://github.com/NVIDIA/apex cd apex pip install -v --disable-pip-version-check --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" ./
How to use the prior version of apex? Are the install commands still below:
git clone https://github.com/NVIDIA/apex cd apex pip install -v --disable-pip-version-check --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" ./
I didn't try it. In my case I simply added the apex directory to the system path using the commands below and it worked for me.
import sys
sys.path.append("/workspace/apex")
Sorry for having been bothering you, I wrote a patch #1211. Could some of you try https://github.com/NVIDIA/apex/commit/25bfcb914796c7955956fc1e39839e7efed997d5 and let me know if the issue is solved or not?
Sorry for having been bothering you, I wrote a patch #1211. Could some of you try 25bfcb9 and let me know if the issue is solved or not?
not solved for me
the problem doesn't seem to be solved yet
Not solved for me either
not solved either
Not solved either
To check if "amp_C" is really installed, add a print
statement in setup.py
arround the place that adding "amp_C" targets. I've seem sometimes using pip install ...
command will NOT trigger these targets, and I have to use command in python setup.py install --cuda_ext
like form.
To check if "amp_C" is really installed, add a
setup.py
arround the place that adding "amp_C" targets. I've seem sometimes usingpip install ...
command will NOT trigger these targets, and I have to use command inpython setup.py install --cuda_ext
like form.
Really Awesome!
How to use the prior version of apex? Are the install commands still below:
git clone https://github.com/NVIDIA/apex cd apex pip install -v --disable-pip-version-check --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" ./
I didn't try it. In my case I simply added the apex directory to the system path using the commands below and it worked for me.
import sys sys.path.append("/workspace/apex")
@lunasara, could explain what's "/workspace/apex"? thanks in advance!!
您好, 邮件已收到。