speech-interaction topic
List
speech-interaction repositories
LLaMA-Omni
3.1k
Stars
217
Forks
3.1k
Watchers
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
MooER
220
Stars
17
Forks
220
Watchers
MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not lim...
OpenOmniNexus
36
Stars
4
Forks
36
Watchers
a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.
OmniMMI
20
Stars
0
Forks
20
Watchers
[CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts