voice-activity-detection topic
voxseg
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
mica-speech-activity-detection
Robust Speech Activity Detection (SAD) in movie audio
Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
zff_vad
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
vad-sli-asr
A pipeline to isolate and transcribe one language in mixed-language speech
whisper_ros
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2
ASR-2Pass
ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).
android-speaker-audioanalysis
This is my Masters thesis project titled "Speaker Detection and Conversation Analysis on Mobile Devices".
Speaker-Diarization
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python