PreSumm icon indicating copy to clipboard operation
PreSumm copied to clipboard

CUDA runtime error on Google Colab

Open NandaKishoreJoshi opened this issue 4 years ago • 2 comments

I tried running the below test command on google colab

python train.py -task abs -mode test -test_from ../models/model_step_148000.pt -batch_size 3000 -test_batch_size 500 -bert_data_path ../bert_data/ -log_file ../logs/val_abs_bert_cnndm -model_path ../models/ -sep_optim true -use_interval true -visible_gpus 1 -max_pos 512 -max_length 200 -alpha 0.95 -min_length 50 -result_path ../logs/abs_bert_cnndm

This gives the error saying that CUDA device is not found. I tried to debug the train.py and found that the code assigns device_id=0 for visible_gus=0,1,2,3. Should we change something here to make it work? This issue is same as closed issue #140 but there is no resolution mentioned. Any help would be much appreciated

NandaKishoreJoshi avatar Jun 12 '20 14:06 NandaKishoreJoshi

Late reply. But here is the solution -visible_gpus = 0

SebastianVeile avatar Jul 08 '20 13:07 SebastianVeile

I tried running the below test command on google colab

python train.py -task abs -mode test -test_from ../models/model_step_148000.pt -batch_size 3000 -test_batch_size 500 -bert_data_path ../bert_data/ -log_file ../logs/val_abs_bert_cnndm -model_path ../models/ -sep_optim true -use_interval true -visible_gpus 1 -max_pos 512 -max_length 200 -alpha 0.95 -min_length 50 -result_path ../logs/abs_bert_cnndm

This gives the error saying that CUDA device is not found. I tried to debug the train.py and found that the code assigns device_id=0 for visible_gus=0,1,2,3. Should we change something here to make it work? This issue is same as closed issue #140 but there is no resolution mentioned. Any help would be much appreciated

Untitled Hello NandaKishoreJoshi, I got an issue when I tried put code on Google Colab. How did you handle this issue?

congdoanit98 avatar Oct 09 '20 01:10 congdoanit98