electra icon indicating copy to clipboard operation
electra copied to clipboard

Multi-GPU training

Open hamidpalangi opened this issue 4 years ago • 7 comments

Hi Kevin,

Thanks for the great work and releasing the codes/models. Was wondering if you have tried multi-GPU training for ELECTRA-base and ELECTRA-large (does your current codes support multi-GPU)? And if you have stats for multi-GPU experiments as well?

Also the stats for single GPU training of ELECTRA-base and ELECTRA-large (how many days needed till they converge to a descent performance?).

Thanks! -Hamid

hamidpalangi avatar Mar 13 '20 00:03 hamidpalangi

I've tried the current starter's code for pretraining a small network. It seems like the model is trained on a single GPU.

zxybazh avatar Mar 13 '20 08:03 zxybazh

Yes. It is working on a single GPU. Looking for multi-gpu support. @clarkkev

008karan avatar Mar 17 '20 07:03 008karan

Gentle follow up Kevin, any thoughts?

Thanks, -Hamid

hamidpalangi avatar Mar 23 '20 18:03 hamidpalangi

Is there any plans for multi-gpu support @clarkkev

008karan avatar May 13 '20 16:05 008karan

Hope to support multiple GPU and provide detailed configuration. @clarkkev

MarkClemens301 avatar May 21 '20 15:05 MarkClemens301

@008karan @Palang2014 How are you getting the model to run on a GPU? Even with a GPU available, I'm only able to run on CPU. Mostly interested in running the fine-tuning, not the pre-training.

sadhikamalladi avatar Sep 13 '20 15:09 sadhikamalladi

@008karan @Palang2014 How are you getting the model to run on a GPU? Even with a GPU available, I'm only able to run on CPU. Mostly interested in running the fine-tuning, not the pre-training.

I had the same issue and I realised it was because the program could not find the cuda libraries. Check if you get messages like "Successfully opened dynamic library libcudnn.so.7" If not, or if you see errors saying it couldn't find some cuda libraries, maybe you have the same problem?

kukrishna avatar Jul 02 '21 11:07 kukrishna