cryptowooser
cryptowooser
In case anybody else has this error, you need to run the software from the base directory of the repository, not the \tha2\app directory.
I'm OOMing as well, is this model just really beefy? Using the defaults, running on linux with Triton etc. installed.
Depends on how much data you've got! I've gotten good results with Japanese with about 66 hours of labeled data.
I just did the segmentation model, but if there's a guide to finetuning the pipeline or the embedding model somewhere I'd love to see it! I'd love to improve the...
Yes, results were a MASSIVE improvement. On Mon, Aug 28, 2023 at 6:13 PM Omar Sayed ***@***.***> wrote: > I just did the segmentation model, but if there's a guide...
Metric used was DER. It dropped I want to say... 20-25%? From 55 or so to mid 30s. This was the methodology I used. pyannote-audio/tutorials/adapting_pretrained_pipeline.ipynb at develop · pyannote/pyannote-audio (github.com)...
No, I used the methodology described in the linked notebook. It worked pretty well, I was impressed! I'm interested in figuring out how I can get more gas out of...
Excellent, I would love to try. Is Speechbrain a better choice than pyannote's own embedding models? I haven't looked at the embedding side of things much, so if there's more...
This would be very cool to have. Is there a pretrained checkpoint available for this somewhere?