audio-generation topic

List audio-generation repositories

AudioLDM

2.3k
Stars
215
Forks
20
Watchers

AudioLDM: Generate speech, sound effects, music and beyond, with text.

LocalAI

28.5k
Stars
2.1k
Forks
198
Watchers

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...

MM-Diffusion

372
Stars
22
Forks
Watchers

[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

AudioLDM2

2.1k
Stars
167
Forks
Watchers

Text-to-Audio/Music Generation

tango

931
Stars
70
Forks
Watchers

A family of diffusion models for text-to-audio generation.

FunCodec

362
Stars
30
Forks
Watchers

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Auffusion

119
Stars
11
Forks
Watchers

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

im2wav

102
Stars
9
Forks
Watchers

Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation

JEN-1-COMPOSER-pytorch

25
Stars
2
Forks
Watchers

Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.19180)

awesome-audio-plaza

344
Stars
13
Forks
Watchers

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation