AudioLDM-training-finetuning
AudioLDM-training-finetuning copied to clipboard
Generating longer audio sequences.
Hi! I was wondering how we could generate a longer audio sequence with the model trained to generate 10 second audio clips? Isnt the Unet architecture fixed and hence the output dimensions the same? the diffusers pipeline seems to do be changing the Unet dimensions,but then dont we need to train it again?
Thank you for your patience, Pranav