Jakub Piotr Cłapa comments

Results 77 comments of


                                            Jakub Piotr Cłapa

Better support for zero-shot voice-cloning

Hey, I implemented the API we talked about. You can now pass a file or URL to the voice sample and it will be used to condition the model: ```...

Emotion markers

This is a nice idea. The trick seems to be getting good ground truth emotional speech samples and labels that don't sound fake. This is not currently supported in any...

Emotion markers

It’s best if you ask on the LAION Discord (link in the README).

CPU + MPS Support

CPU could be supported through whisper.cpp/llama.cpp but we are not working on that right now. MPS should work with minimal tweaks (there may be some hardcoded “cuda” settings).

CPU + MPS Support

@fakerybakery You can try and report back how difficult it is :) I don't have this on my roadmap right now (I am mostly focused on improving quality and language...

CPU + MPS Support

@BBC-Esq we are using [nbdev](https://nbdev.fast.ai/). it allows you to edit either the notebooks or the `.py` files and later synchronize the changes. I am on holiday next week but afterwards...

CPU + MPS Support

Regarding Vocos and MPS maybe it would be worth raising an issue on their GitHub and see what the author says? I was using this model as-is so I am...

CPU + MPS Support

The sdp_kernel is kind of important for performance on CUDA so we’d have to figure out how to make them transparent for MPS. Maybe make a new context manager that...

CPU + MPS Support

I was thinking about writing to the Vocos author since I believe sometimes the offending operations can be changed to something a little bit different that works out of the...

6. Gather more multi-lingual data

I think we need native speakers to ensure high quality material and build the best global open source TTS system. I am thinking of setting up a common format and...