Coupled-Iterative-Refinement icon indicating copy to clipboard operation
Coupled-Iterative-Refinement copied to clipboard

RuntimeError: CUDA error: no kernel image is available for execution on the device

Open kooshyarkosari opened this issue 2 years ago • 9 comments

Dear Author Thank you very much for your excellent and amazing work: I tried to replicate the demo file but got a flowing error.

Screenshot (78)

configuration is :

1)Nvidia / cuda11.30-deve-ubuntu 20.94 (docker container) 2) torch-1.8.0+cu111

Screenshot (80)

kooshyarkosari avatar Jan 07 '23 09:01 kooshyarkosari

Did you manage to resolve the issue? If not, have you tried reinstalling the conda environment? I've seen this message when reusing pre-built binaries after I've updated my cuda version.

lahavlipson avatar Jan 16 '23 21:01 lahavlipson

Hi,

not yet unfurtentelly. I have tried different ubuntu versions as well as the Cuda version, moreover, I have reinstalled conda environment but the issue still remained

kooshyarkosari avatar Jan 18 '23 07:01 kooshyarkosari

I was able to reproduce the problem on a Tesla K80, but I wasn't able to find a solution unfortunately.

It looks like lietorch needs pytorch>=1.7, but this pytorch version can cause the aforementioned issue on this particular graphics card.

This issue doesn't seem to happen on a GTX-1080 or any newer cards. I'll keep looking and update this thread if I find a solution.

lahavlipson avatar Jan 23 '23 02:01 lahavlipson

ok thank you very much

kooshyarkosari avatar Jan 23 '23 07:01 kooshyarkosari

do you have a solution?

dudulry avatar Mar 09 '23 04:03 dudulry

not yet unfurtentelly.

kooshyarkosari avatar Mar 09 '23 07:03 kooshyarkosari

not yet unfurtentelly.

Have you used APEX?I encountered this error after using APEX. I fixed it by creating a new environment with RTX3090, CUDA 11.6, Python 3.7, and Torch 1.12.now it can work.

dudulry avatar Mar 10 '23 03:03 dudulry

no , since I have just access to Tesla K80 GPU

kooshyarkosari avatar Mar 10 '23 07:03 kooshyarkosari

Excuse me, has anyone encountered this problem?

(cir) bimlab@bimlab-server:~/pporzz/Coupled-Iterative-Refinement$ python demo.py --obj_models lmo --scene_dir /home/bimlab/pporzz/Coupled-Iterative-Refinement --load_weights model_weights/refiner/ycbv_rgbd.pth

/home/bimlab/miniconda3/envs/cir/lib/python3.8/site-packages/torchvision/io/image.py:13: UserWarning: Failed to load image Python extension: '/home/bimlab/miniconda3/envs/cir/lib/python3.8/site-packages/torchvision/image.so: undefined symbol: _ZN3c104warnERKNS_7WarningE'If you don't plan on using image functionality from torchvision.io, you can ignore this warning. Otherwise, there might be something wrong with your environment. Did you have libjpeg or libpng installed before building torchvision from source? warn( terminate called after throwing an instance of 'std::bad_alloc' what(): std::bad_alloc Aborted (core dumped)

lin-fangzhou avatar May 08 '23 07:05 lin-fangzhou