text-to-audio topic

List text-to-audio repositories

nuwa-pytorch

534
Stars
62
Forks
Watchers

Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch

WaveGrad2

66
Stars
16
Forks
Watchers

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

word2wave

116
Stars
16
Forks
Watchers

Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.

Amphion

4.0k
Stars
336
Forks
40
Watchers

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio,...

audio-webui

931
Stars
89
Forks
Watchers

A webui for different audio related Neural Networks

sub-to-audio

91
Stars
9
Forks
Watchers

Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.

audioldm-colab

20
Stars
3
Forks
Watchers

AudioLDM text to audio colab

soundstorm

21
Stars
7
Forks
Watchers

Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthusias...

tango

931
Stars
70
Forks
Watchers

A family of diffusion models for text-to-audio generation.

Auffusion

119
Stars
11
Forks
Watchers

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"