AudioLDM "CUDA out of memory" when generating longer duration audio. Any way to fix?

"CUDA out of memory" when generating longer duration audio. Any way to fix?

Open asudyn opened this issue 1 year ago • 2 comments

I'm an enthusiast trying to see what is possible with this model and I can generate 30 second pieces on my RTX 3060 just fine but when I try 60 seconds I get this:

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 6.75 GiB (GPU 0; 12.00 GiB total capacity; 10.34 GiB already allocated; 0 bytes free; 10.69 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

Is it possible to generate longer audio without buying a new GPU?

Apr 05 '23 05:04 asudyn

30 seconds is already great for a 3060. But if you want, you can reduce the -n N_CANDIDATE_GEN_PER_TEXT to the lowest value, 1. From my understanding, this option generates a set number of candidates. So if it's set to 5, then the single generation comes from 5 attempts who all use up lots of memory, and it tries to pick the best attempt to output.

However, I have experimented with this. By setting it to 1, I cannot consistently hear a quality difference, plus it becomes MUCH faster and uses MUCH less memory.

Apr 11 '23 10:04 CounterFlow64

The -n thing doesn't work I get the exact same oom issue with 3 and 1.

Apr 12 '23 05:04 gelukuMLG

AudioLDM AudioLDM copied to clipboard

"CUDA out of memory" when generating longer duration audio. Any way to fix?

AudioLDM
AudioLDM copied to clipboard