audio-generation topic
SpecVQGAN
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
Catch-A-Waveform
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
word2wave
Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
awesome-sound_event_detection
Reading list for research topics in Sound AI
audio-data-pytorch
A collection of useful audio datasets and transforms for PyTorch.
audio-diffusion-pytorch-trainer
Trainer for audio-diffusion-pytorch
audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio,...