cryptowooser comments

Results 9 comments of


                                            cryptowooser

ModuleNotFoundError: No module named 'tha2'

In case anybody else has this error, you need to run the software from the base directory of the repository, not the \tha2\app directory.

How much VRAM do I need to run this on Gradio?

I'm OOMing as well, is this model just really beefy? Using the defaults, running on linux with Triton etc. installed.

Is the fine-tuning model applicable to other languages

Depends on how much data you've got! I've gotten good results with Japanese with about 66 hours of labeled data.

Is the fine-tuning model applicable to other languages

I just did the segmentation model, but if there's a guide to finetuning the pipeline or the embedding model somewhere I'd love to see it! I'd love to improve the...

Is the fine-tuning model applicable to other languages

Yes, results were a MASSIVE improvement. On Mon, Aug 28, 2023 at 6:13 PM Omar Sayed ***@***.***> wrote: > I just did the segmentation model, but if there's a guide...

Is the fine-tuning model applicable to other languages

Metric used was DER. It dropped I want to say... 20-25%? From 55 or so to mid 30s. This was the methodology I used. pyannote-audio/tutorials/adapting_pretrained_pipeline.ipynb at develop · pyannote/pyannote-audio (github.com)...

Is the fine-tuning model applicable to other languages

No, I used the methodology described in the linked notebook. It worked pretty well, I was impressed! I'm interested in figuring out how I can get more gas out of...

Is the fine-tuning model applicable to other languages

Excellent, I would love to try. Is Speechbrain a better choice than pyannote's own embedding models? I haven't looked at the embedding side of things much, so if there's more...

Feat/add wavlm based embeddings model

This would be very cool to have. Is there a pretrained checkpoint available for this somewhere?