voice-activity-detection topic

List voice-activity-detection repositories

Datadriven-GPVAD

90
Stars
23
Forks
Watchers

The codebase for Data-driven general-purpose voice activity detection.

GPV

140
Stars
29
Forks
Watchers

Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper

ffsubsync

6.6k
Stars
265
Forks
Watchers

Automagically synchronize subtitles with video.

pyannote-audio

5.3k
Stars
705
Forks
Watchers

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

open-speech-corpora

1.2k
Stars
130
Forks
Watchers

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

voice_datasets

1.6k
Stars
221
Forks
Watchers

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

inaSpeechSegmenter

702
Stars
125
Forks
Watchers

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

VAD

825
Stars
229
Forks
Watchers

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

NoiseTorch

9.0k
Stars
228
Forks
Watchers

Real-time microphone noise suppression on Linux.