speech-processing topic
SoundStorm-pytorch
Google's SoundStorm: Efficient Parallel Audio Generation
speech-adapters
Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech understanding
be_nlp_speech_resources
Links to Belarusian NLP and Speech resources
speech-emotion-recognition
A program that uses neural networks to detect emotions from pre-recorded and real-time speech
RuntimeSpeechRecognizer
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) a...
ElevateAIJavaSDK
Java SDK for ElevateAI
ElevateAIDotNetSDK
.Net core 6 SDK for ElevateAI
voxseg
A python library for voice activity detection (VAD) for speech/non-speech segmentation.