Hervé BREDIN comments

Results 270 comments of


                                            Hervé BREDIN

Speaker diarization with pyannote.audio?

I have been meaning to add this kind of progress hook for the [online demo](https://huggingface.co/spaces/pyannote/pretrained-pipelines) but it never really reached the top of my priority list. Those are the two...

Speaker diarization with pyannote.audio?

FYI, I just [released](https://huggingface.co/pyannote/speaker-diarization) a much faster/more accurate version of `pyannote.audio` speaker diarization pipeline. It still does not expose the progress of the individual steps but this is now on...

Speaker diarization with pyannote.audio?

Nothing built in `pyannote` comes to mind. You'd have to postprocess the [`pyannote.core.Annotation`](http://pyannote.github.io/pyannote-core/reference.html#annotation) instance returned by the pipeline: 1. remove any segment fully contained by a larger segment ``` [------A-------]...

Wrapper around various audio dataset libraries

That is a good idea, but soundata is not the only one in town :) There is also * [lhotse](https://github.com/lhotse-speech/lhotse/tree/master/lhotse/recipes) * [torchaudio](https://pytorch.org/audio/stable/datasets.html) * [datasets](https://huggingface.co/datasets/librispeech_asr)

`LABLoader` raise ValueError("`path` must contain the {uri} placeholder.") even if the placeholder is configured correctly

The difference between `rttm` and `lab` lies in the fact that * `lab` format has no `filename` field, one `lab` file can therefore contain annotations for only one audio file....

FFmpeg script to convert VoxCeleb 2 dataset to wav files

Or maybe find a way to avoid this conversion (though it would probably result in slower training because of on-the-fly m4a decoding).

ValueError: Could not find any protocol for "VoxCeleb" database. Please refer to pyannote.database documentation to learn how to define them: https://github.com/pyannote/pyannote-database

Did you `pip install pyannote.db.voxceleb` ?

Hervé BREDIN

Speaker diarization with pyannote.audio?

Speaker diarization with pyannote.audio?

Speaker diarization with pyannote.audio?

Wrapper around various audio dataset libraries

`LABLoader` raise ValueError("`path` must contain the {uri} placeholder.") even if the placeholder is configured correctly

FFmpeg script to convert VoxCeleb 2 dataset to wav files

ValueError: Could not find any protocol for "VoxCeleb" database. Please refer to pyannote.database documentation to learn how to define them: https://github.com/pyannote/pyannote-database

[WIP] Tiered annotations

ImportError: cannot import name 'registry' from 'pyannote.database'

Make Annotation.write_rttm follow most of RTTM specs