im2wav icon indicating copy to clipboard operation
im2wav copied to clipboard

Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation

Results 3 im2wav issues
Sort by recently updated
recently updated
newest added

The sound obtained by following the steps in readme and using the pre trained model is noise. May I ask what is the reason for this?

I am having trouble installing packages! my stderr: (im2wav) D:\miszczes\python\im2wav>pip install -r requirements.txt Looking in indexes: https://pypi.org/simple, https://download.pytorch.org/whl/cu116 Collecting git+https://github.com/openai/CLIP.git (from -r requirements.txt (line 12)) Cloning https://github.com/openai/CLIP.git to c:\users\mszczesn\appdata\local\temp\pip-req-build-w709z4tz Running...

When loading the wav_vq checkpoint, I encounter this error. Any suggestion on how can I fix it?