voicebox-pytorch
voicebox-pytorch copied to clipboard
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Mel model
May I ask if this implementation of the model has been experimented on the MEL spectrum.? I used Transformer model with only convolutional positional coding added at the beginning to...
Hey, I am looking at the code, paper and Meta's demos and i have strong feeling that there is something wrong with the way infill is solved. On all demos...
Hey everyone, i am tryin to figure out what values of σ aka sigma is meant to be used during training? There are no mentioning of a specific value in...
Hello, First off, I just want to express my deep appreciation for everyone who put this repo together. It's very well done. I'm trying to pretrain the Voicebox model unconditionally...
Could you please provide script how to train Voicebox as TTS model using duration predictors, phonemes and so on?
The training example given seems to be missing the mask vector? In the paper the input to the model was the audio, mask and the phoneme sequence (which was aligned...
Hi there Ive tried building and saving the model using the speartts repo but the structure of the output doesnt align. Have also tried matching the parameters in both. Is...
I tried to look this file in fairseq repo but I couldn't find. Could you please point out where we can download kmeans.bin file? kmeans_path = './path/to/hubert/kmeans.bin
Hello, First off, thanks for making this repo as well as all the other awesome work you do making high quality, open source implementations of SOTA ML papers! I've been...
Hey, great work.. I am looking for the some audio samples to check out the performance or even a step-by-step guide to set it up and run. Thanks!!