Hervé BREDIN comments

Results 270 comments of


                                            Hervé BREDIN

Get rid of librosa dependency in favor of torchaudio

Related: https://pytorch.org/audio/stable/tutorials/audio_resampling_tutorial.html#comparison-against-librosa

Pre-loading background noise (and IR) in init?

Yes. Big RAM, slow I/O as well, here :)

Add support for spectrogram transforms

Thinking out loud: would it make sense to make no API distinction between feature extraction and augmentation? Spectrogram extraction could be seen as an augmentation of audio. It would allow...

Add a few more RIRs for demo/testing purposes

Related: https://github.com/yluo42/FRA-RIR

Interactive pyannote.audio.utils.preview

[Working on it](https://twitter.com/hbredin/status/1597262288146563072?s=20&t=ckBD2GGTexSYp-p-m9Plxg)

ArcFace embedding task is broken

@olvb should be fixed in `develop` branch.

Diarization and Database recipes

Closing as this will be integrated into [pyannotebook](https://github.com/hbredin/pyannotebook).

VAD Significant inconsistency in the results when using CPU and GPU

Could be related to #1370 if GPU was an Ampere (e.g. A100) device.

Useful multi-label models

[VocalSound](https://github.com/YuanGongND/vocalsound)

Useful multi-label models

[EpicSounds](https://github.com/epic-kitchens/epic-sounds-annotations)