speech-classification topic
soxan
Wav2Vec for speech recognition, classification, and audio classification
ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
Speech-Commands-Classification-by-LSTM-PyTorch
Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augmentation.
Transformer-based-SER
Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch
Toxic_Speech_Classification
It is a full-fetched web application.Based on sentiment classification, by using nltk library it predicts that a speech is how much toxic, sever toxic, insult, obscene, threat.
Audio-Mamba-AuM
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"