speech-language-model topic

List speech-language-model repositories

xcodec

95
Stars
3
Forks
Watchers

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

LLaMA-Omni

2.0k
Stars
111
Forks
Watchers

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

WavTokenizer

1.2k
Stars
102
Forks
1.2k
Watchers

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

SLED-TTS

104
Stars
7
Forks
104
Watchers

Streamable Text-to-Speech model using a language modeling approach, without vector quantization

WavChat

310
Stars
17
Forks
310
Watchers

A Survey of Spoken Dialogue Models (60 pages)

SoCodec

82
Stars
7
Forks
82
Watchers

Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications