speech-to-speech topic
awesome-speech-translation
rtvc
💬 "Realtime" voice transcription and cloning using ElevenLabs's API.
real-time-voice-translator
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
Echo-XI
Speech to text to speech using Elevenlabs
DASpeech
Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
speech-to-speech
Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"
Applio
A simple, high-quality voice conversion tool focused on ease of use and performance.
Soul-of-Waifu
If you've ever had the wish to talk to your AI Waifu using quality characters and voices for character voicing, then I suggest Soul of Waifu. Don't miss the opportunity to touch your dream!
LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.