AudioLDM
AudioLDM copied to clipboard
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Hugging Face web demo at this moment is failing to output the result as a video file. Are there any plans to add Google Colab notebook in the repository?
Could the authors kindly share the training code and dataset?
hey all trying to run the app.py with the readme directions and am getting back out the following error on ubuntu 23.04 `` DiffusionWrapper has 185.04 M params. /home/jerrick/anaconda3/envs/audioldm/lib/python3.8/site-packages/torchlibrosa/stft.py:193: FutureWarning:...
I followed the instructions but kept getting complaints that "Torch not compiled with CUDA enabled". I tried switching Torch to the CU variants, but that only resulting in cascading complaints...
I did everything according to the instructions, but I still ran into this problem. my system is windows 11 then ran the command: python3 scripts/text2sound.py -t "A hammer is hitting...
Would love to see code to reproduce the paper's super resolution
I just did some basic formatting, syntax, and other minor fixes.
audioldm does never finish it's audio generation i don't even think it really starts it. i waited over 2000 seconds which is over 30 minutes but nothing happens just counting...
Hello, I am using the text_to_audio() function. In this line, https://github.com/haoheliu/AudioLDM/blob/6450a512e078c2c86b7aed86dadcc8964562fe59/audioldm/ldm.py#L151 Why do you pass the text prompt two times to the self.cond_stage_model?