AudioLDM-training-finetuning Generating longer audio sequences.

Generating longer audio sequences.

Open Praniyendeev opened this issue 1 year ago • 0 comments

Hi! I was wondering how we could generate a longer audio sequence with the model trained to generate 10 second audio clips? Isnt the Unet architecture fixed and hence the output dimensions the same? the diffusers pipeline seems to do be changing the Unet dimensions,but then dont we need to train it again?

Thank you for your patience, Pranav

Jan 16 '24 18:01 Praniyendeev

AudioLDM-training-finetuning AudioLDM-training-finetuning copied to clipboard

Generating longer audio sequences.

AudioLDM-training-finetuning
AudioLDM-training-finetuning copied to clipboard