voice-computing topic
voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
voicebook
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
allie
🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.
download_audioset
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
voice_gender_detection
♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
sound_event_detection
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
audioset_models
📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
nala
🦁 Nala is an agile open-source voice assistant framework (20+ actions).
pauses
🎤 quick library to extract pause lengths from audio files.