speech-detection topic
ffsubsync
Automagically synchronize subtitles with video.
subsync
Synchronize your subtitles using machine learning
inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
android-vad
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
bbc-speech-segmenter
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
RuntimeSpeechRecognizer
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
edusense
EduSense: Practical Classroom Sensing at Scale
Speaker-Diarization
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python