ColossalAI-Examples icon indicating copy to clipboard operation
ColossalAI-Examples copied to clipboard

ModuleNotFoundError: No module named 'colossal_layer_norm_cuda'

Open 200987299 opened this issue 2 years ago • 3 comments

Dear developers,

I am trying to run the bert example but I got this error, any hint to fix it?

Thanks

200987299 avatar Apr 24 '22 09:04 200987299

Hi, this is because you CUDA extension is not built. Can you make sure your PyTorch uses the same cuda version as your cuda runtime. One way is to add the -v flag to your pip install command. If cuda extension is not built, the log will tell you the info about the version mismatch.

FrankLeeeee avatar Apr 24 '22 09:04 FrankLeeeee

Hi, this is because you CUDA extension is not built. Can you make sure your PyTorch uses the same cuda version as your cuda runtime. One way is to add the -v flag to your pip install command. If cuda extension is not built, the log will tell you the info about the version mismatch.

Hi, could you be more specific? I tried few times reinstalling all the stuff, but I failed every time. The pip -v install shown everything was fine.

Thanks

200987299 avatar Apr 27 '22 08:04 200987299

Hi @200987299 We have updated and simplified the installation process, and the example code has been updated. You can try to reinstall, thanks.

binmakeswell avatar Jun 21 '22 09:06 binmakeswell