speech-activity-detection topic
Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
GPV
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
kaldi
Fork of the official kaldi.
speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer
zff_vad
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
Depression-Engine
Detecting depressed Patient based on Speech Activity, Pauses in Speech and Using Deep learning Approach