What kind of GPU do you use to train svd_xt with 1024x576 resolution and 25 frames.

Open chenbinghui1 opened this issue 1 year ago • 3 comments

I find with this configuration, 80G A100 is not enough for even batch_size=1. Am I right?

Feb 07 '24 04:02 chenbinghui1

As mentioned in the report "Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets," 8 80GB A100 GPUs were used for training.

Mar 19 '24 23:03 fyang064

I also tried training with LoRA, and even with a batch size of 1, there was still memory overflow. It seems that model parallel training is needed.

May 08 '24 16:05 DataAIPlayer