speech-processing topic
Voice2Series-Reprogramming
ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time Series Classification
RobustVC
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degradation / adversarial robustness of VC models.
SpeechPrompt
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speech processing with prompting paradigm
speechrec
a simple speech recognition app using the Web Speech API Interfaces
rte-speech-generator
Natural Language Processing to generate new speeches for the President of Turkey.
MASG
microphone array speech generator (MASG) in room acoustic
indic-num2words
Python library for converting numbers to words for all Indian Languages.
speech2affective_gestures
This is the official implementation of the paper "Speech2AffectiveGestures: Synthesizing Co-Speech Gestures with Generative Adversarial Affective Expression Learning".
torchscale
Foundation Architecture for (M)LLMs
Virtual-Assistance-For-The-Blind
The proposed Voice-based Email System uses AI (voice commands) that will make the email system very easily accessible to visually challenged people and also help society. Accessibility is the most imp...