DiffSynth-Studio
DiffSynth-Studio copied to clipboard
use_gradient_checkpointing_offload occupies more VRAM when train i2v model
When training the I2V wanx model, using use_gradient_checkpointing_offload occupies more VRAM than using use_gradient_checkpointing. If you have time, could you please take a look? Thank you.