speech-commands topic
ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
nyumaya_audio_recognition
Classify audio with neural nets on embedded systems like the Raspberry Pi
audiossl
A library built for easier audio self-supervised training, downstream tasks evaluation
speech-recognition-transfer-learning
Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow
kws-attention
Attention-based model for keywords spotting
DiffWave-unconditional
Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.
tensorflow-speech-recognition-challenge
Kaggle Competitions: TensorFlow Speech Recognition Challenge
BiFSMN
Pytorch implementation of BiFSMN, IJCAI 2022