metavoice-src
metavoice-src copied to clipboard
Foundational model for human-like, expressive TTS
MPS Support
Hi, Congrats on the launch! Is MPS (Apple Silicon) or MLX support planned? Thank you!
**Python 3.9.15** Trying to execute the `sample.py` produced 3 typing errors. I had just removed type annotations and it proceeded to the next step (huggingface credentials verification). Please specify Python...
Maybe one day Python devs will surprise me :) But seriously, need to add that - thanks!
Added Dockerfile & docker-compose.yml for containerized deployment. Had to make a slight adjustment to `servings.py` changing host ip from `127.0.0.1` to `0.0.0.0`. Hope this helps others with quick and convenient...
Hey! Thank you so so much for this repo and great work, this is what the world needs right now, I have been waiting for such a great foundation model...
Is improving the latency of the model on the roadmap, e.g. real-time TTS?
It shouldn't be hard for you. It can be ugly looking and bad coded, just works is sufficient
How much VRAM does the loaded model require? He said RTX4080 is not enough for 12GB GPU
Guys, please specify Python version that you've tested and specific versions of `requirements.txt`. `pip3 freeze > requirements.txt` I had to install 3 copies of `xformers`: ![xformers](https://github.com/metavoiceio/metavoice-src/assets/1624921/685c6bf6-2c4b-4953-9782-67ef2afea208)