pyannote-audio
pyannote-audio copied to clipboard
finetuned pipeline takes way longer to run
Hi @hbredin , I was able to finetune the segmentation model as per the tutorial. I run it for 20 epochs now. However, now, the finetuned Speaker Diarization pipeline takes a very long time to run on my audio files as compared to the pretrained model. It seems to stop on the forward() / conv_forward() steps. Why is this?? I'm very confused. Takes hours.
Thank you for your issue. We found the following entries in the FAQ which you may find helpful:
- Can I apply pretrained pipelines on audio already loaded in memory?
- Can I use gated models (and pipelines) offline?
- How can I improve performance?
Feel free to close this issue if you found an answer in the FAQ.
If your issue is a feature request, please read this first and update your request accordingly, if needed.
If your issue is a bug report, please provide a minimum reproducible example as a link to a self-contained Google Colab notebook containing everthing needed to reproduce the bug:
- installation
- data preparation
- model download
- etc.
Providing an MRE will increase your chance of getting an answer from the community (either maintainers or other power users).
Companies relying on pyannote.audio
in production may contact me via email regarding:
- paid scientific consulting around speaker diarization and speech processing in general;
- custom models and tailored features (via the local tech transfer office).
This is an automated reply, generated by FAQtory
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.