PaddleSpeech icon indicating copy to clipboard operation
PaddleSpeech copied to clipboard

train models with gpu memory error

Open Logan-SUN opened this issue 1 year ago • 5 comments

General Question

I try to train a s2t model by paddlespeech on our owns machine with 7 gpus, but always report error about having no aviliable GPU memory, I have set batch_size ==8 and wav-id is sorted by name, our dataset contains 300+ million audios totally. Can anyone help me solve this gpu memory issue please? THX

Logan-SUN avatar Sep 01 '22 06:09 Logan-SUN

What is the approximate duration of each audio in your dataset? If the input audios are very long, more GPU memory is required.

Zth9730 avatar Sep 02 '22 02:09 Zth9730

I have checked that the maximum duration of our audio is less then 10 seconds, thanks by the way. The error I have captured as followed: image

Logan-SUN avatar Sep 02 '22 08:09 Logan-SUN

Are you using the same model structure and configuration as our repository provides? Whether to add or delete modules or change model size?

Zth9730 avatar Sep 02 '22 15:09 Zth9730

I used the same model structure and configuration as your pretrained model named asr1_conformer_wenetspeech_ckpt_0.1.1.model which is published on websit. image

Logan-SUN avatar Sep 05 '22 07:09 Logan-SUN

Your GPU mem is not enough, please reduce batchsize to fit you GPU device.

zh794390558 avatar Sep 05 '22 12:09 zh794390558