voice-activity-detection topic

List voice-activity-detection repositories

voxseg

76
Stars
12
Forks
Watchers

A python library for voice activity detection (VAD) for speech/non-speech segmentation.

mica-speech-activity-detection

25
Stars
10
Forks
Watchers

Robust Speech Activity Detection (SAD) in movie audio

Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.

WhisperS2T

285
Stars
28
Forks
Watchers

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

zff_vad

18
Stars
1
Forks
Watchers

Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering

vad-sli-asr

18
Stars
3
Forks
Watchers

A pipeline to isolate and transcribe one language in mixed-language speech

whisper_ros

44
Stars
12
Forks
Watchers

Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2

ASR-2Pass

47
Stars
7
Forks
Watchers

ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).

android-speaker-audioanalysis

15
Stars
16
Forks
Watchers

This is my Masters thesis project titled "Speaker Detection and Conversation Analysis on Mobile Devices".

Speaker-Diarization

15
Stars
2
Forks
Watchers

Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python