RealBasicVSR icon indicating copy to clipboard operation
RealBasicVSR copied to clipboard

model training problem

Open RavenKang opened this issue 2 years ago • 3 comments

When trying to train on multiple GPUs the error:

ValueError: You may use too small dataset and our distributed sampler cannot pad your dataset correctly. We highly recommend you to use fewer GPUs to finish your work

I follow the instructions below: Put the original REDS dataset in ./data Run the following command: python crop_sub_images.py --data-root ./data/REDS --scales 4

and training model follow the instructions mim train mmedit configs/realbasicvsr_wogan_c64b20_2x30x8_lr1e-4_300k_reds.py --gpus 2 --launcher pytorch

RavenKang avatar May 31 '22 07:05 RavenKang

How many GPUs are you using?

ckkelvinchan avatar Jun 02 '22 22:06 ckkelvinchan

How many GPUs are you using?

I use 2 GPUs

RavenKang avatar Jun 08 '22 02:06 RavenKang

I also encountered the same problem and solved it in the following ways Modify code in realbasicvsr_wogan_c64b20_2x30x8_lr1e-4_300k_reds.py workers_per_gpu=4, num_input_frames=8,

zlu1994 avatar Jun 08 '22 03:06 zlu1994