voice-activity-detection topic
Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
GPV
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
Python-ai-assistant
Python AI assistant ðŸ§
ffsubsync
Automagically synchronize subtitles with video.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
NoiseTorch
Real-time microphone noise suppression on Linux.