speech-representation topic

List speech-representation repositories

s3prl

2.1k
Stars
475
Forks
Watchers

Self-Supervised Speech Pre-training and Representation Learning Toolkit

MiniASR

47
Stars
6
Forks
Watchers

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

Mockingjay-Speech-Representation

52
Stars
11
Forks
Watchers

Official Implementation of Mockingjay in Pytorch

lighthubert

70
Stars
6
Forks
Watchers

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

emotion2vec

428
Stars
33
Forks
Watchers

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

WavTokenizer

1.2k
Stars
105
Forks
1.2k
Watchers

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

WavChat

314
Stars
17
Forks
314
Watchers

A Survey of Spoken Dialogue Models (60 pages)

MagiCodec

111
Stars
6
Forks
111
Watchers

A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.

dusted

18
Stars
0
Forks
18
Watchers

DUSTED: Spoken-Term Discovery using Discrete Speech Units