TTS-WebUI
TTS-WebUI copied to clipboard
adding YuE
YuE has multiple layers of issues that make it a pain to integrate. For example, they use a modified vocos that obviously interferes with the existing vocos library. Then the installation requires cloning a 1.75gb repository and it's not clear how soon any fixes will be out of date.
Currently it appears that the inference code is quite convoluted, the repository is not a package and requires multiple fixes.
From issue: https://github.com/rsxdalv/tts-generation-webui/issues/434
Particularly problematic file: https://github.com/multimodal-art-projection/YuE/blob/main/inference/infer.py