speech-representation topic
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
lighthubert
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
WavTokenizer
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
WavChat
A Survey of Spoken Dialogue Models (60 pages)