wav2vec2 topic
audio-classification-pytorch
In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number o...
self-supervised-phone-segmentation
Phoneme segmentation using pre-trained speech models
zac2022-lyric-alignment
Solution for Zalo AI Challenge 2022 - Lyrics Alignment
wav2vec4bp
Wav2vec resources and models for Brazilian Portuguese
LLM-Minutes-of-Meeting
🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where...
ASRAdversarialAttacks
An ASR (Automatic Speech Recognition) adversarial attack repository.
Wav2Vec2FBX
Recognize speech from an audio file and convert it into animation FBX
ShiftSER
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
wav2vec2-turkish
Turkish Speech Recognition using Facebook's Wav2vec 2.0 models
Wav2vec2.0
Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.