electra Multi-GPU training

Hi Kevin,

Thanks for the great work and releasing the codes/models. Was wondering if you have tried multi-GPU training for ELECTRA-base and ELECTRA-large (does your current codes support multi-GPU)? And if you have stats for multi-GPU experiments as well?

Also the stats for single GPU training of ELECTRA-base and ELECTRA-large (how many days needed till they converge to a descent performance?).

Thanks! -Hamid

Mar 13 '20 00:03 hamidpalangi

I've tried the current starter's code for pretraining a small network. It seems like the model is trained on a single GPU.

Mar 13 '20 08:03 zxybazh

Yes. It is working on a single GPU. Looking for multi-gpu support. @clarkkev

Mar 17 '20 07:03 008karan

Gentle follow up Kevin, any thoughts?

Thanks, -Hamid

Mar 23 '20 18:03 hamidpalangi

Is there any plans for multi-gpu support @clarkkev

May 13 '20 16:05 008karan

Hope to support multiple GPU and provide detailed configuration. @clarkkev

May 21 '20 15:05 MarkClemens301

@008karan @Palang2014 How are you getting the model to run on a GPU? Even with a GPU available, I'm only able to run on CPU. Mostly interested in running the fine-tuning, not the pre-training.

Sep 13 '20 15:09 sadhikamalladi

@008karan @Palang2014 How are you getting the model to run on a GPU? Even with a GPU available, I'm only able to run on CPU. Mostly interested in running the fine-tuning, not the pre-training.

I had the same issue and I realised it was because the program could not find the cuda libraries. Check if you get messages like "Successfully opened dynamic library libcudnn.so.7" If not, or if you see errors saying it couldn't find some cuda libraries, maybe you have the same problem?

Jul 02 '21 11:07 kukrishna

electra electra copied to clipboard

Multi-GPU training

electra
electra copied to clipboard