Hervé BREDIN
Hervé BREDIN
This PR now has conflicts.
Can you please share the output of `print(waveform.shape, sr, waveform.shape[1] / sr)`?
Merged but I think there is still some work needed on this tutorials. For instance (and I did not go over everything, far from it), this sentence "The pipeline will...
Would definitely be easier and faster if you shared the audio file and a Google Colab I can just run... In the meantime, yes, `diarization.crop(...)` should do the trick.
This is not (yet) supported.
Would you mind sharing a link to a Google Colab that one can just click and run to reproduce the issue?
Adding `cannot_reproduce` label because, well, I cannot reproduce it.
Not in 3.x, no. I am considering adding back the option but cannot provide an ETA though. Can you say more about your use case?
I would then use `pyannote/segmentation` for this purpose, wrapped in a voice activity detection pipeline that comes with onse/offset thresholds: https://huggingface.co/pyannote/segmentation#voice-activity-detection