voice-activity-detection topic
diart
A python package to build AI-powered real-time audio applications
End-to-End-Speech-Recognition-Models
PyTorch implementation of automatic speech recognition models.
nala
🦁 Nala is an agile open-source voice assistant framework (20+ actions).
pauses
🎤 quick library to extract pause lengths from audio files.
robust-vad
Lightweight CNN for Robust Voice Activity Detection
spokestack-ios
Spokestack: give your iOS app a voice interface!
spectra
Spectra extraction tutorials based on torch and torchaudio.
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
itsp
Introduction to Speech Processing
whisper-auto-transcribe
Auto transcribe tool based on whisper