speech-interaction topic

List speech-interaction repositories

LLaMA-Omni

3.1k
Stars
217
Forks
3.1k
Watchers

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

MooER

220
Stars
17
Forks
220
Watchers

MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not lim...

OpenOmniNexus

36
Stars
4
Forks
36
Watchers

a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.

OmniMMI

20
Stars
0
Forks
20
Watchers

[CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts