voicebox-pytorch issues

Mel model

10

May I ask if this implementation of the model has been experimented on the MEL spectrum.? I used Transformer model with only convolutional positional coding added at the beginning to...

lixuyuan102

Probably invalid infill logic

Hey, I am looking at the code, paper and Meta's demos and i have strong feeling that there is something wrong with the way infill is solved. On all demos...

ex3ndr

How to pick sigma?

3

Hey everyone, i am tryin to figure out what values of σ aka sigma is meant to be used during training? There are no mentioning of a specific value in...

ex3ndr

Training Unconditional Model

15

Hello, First off, I just want to express my deep appreciation for everyone who put this repo together. It's very well done. I'm trying to pretrain the Voicebox model unconditionally...

nrocketmann

Training TTS

2

Could you please provide script how to train Voicebox as TTS model using duration predictors, phonemes and so on?

Subuday

Training Example

19

The training example given seems to be missing the mask vector? In the paper the input to the model was the audio, mask and the phoneme sequence (which was aligned...

YKoustubhRao

speartts model

3

Hi there Ive tried building and saving the model using the speartts repo but the structure of the output doesnt align. Have also tried matching the parameters in both. Is...

happy-machine

where to get the kmeans_path = './path/to/hubert/kmeans.bin file?

7

I tried to look this file in fairseq repo but I couldn't find. Could you please point out where we can download kmeans.bin file? kmeans_path = './path/to/hubert/kmeans.bin

furqan4545

Dtype Issues on Inference

Hello, First off, thanks for making this repo as well as all the other awesome work you do making high quality, open source implementations of SOTA ML papers! I've been...

nrocketmann

Samples for audio and steps to run the experiment?

3

Hey, great work.. I am looking for the some audio samples to check out the performance or even a step-by-step guide to set it up and run. Thanks!!

RaiAmanRai

voicebox-pytorch
voicebox-pytorch copied to clipboard

Metadata

Mel model

Probably invalid infill logic

How to pick sigma?

Training Unconditional Model

Training TTS

Training Example

speartts model

where to get the kmeans_path = './path/to/hubert/kmeans.bin file?

Dtype Issues on Inference

Samples for audio and steps to run the experiment?

← Metadata

Owner

Metadata

voicebox-pytorch voicebox-pytorch copied to clipboard

Metadata

← Metadata

Owner

Metadata

voicebox-pytorch
voicebox-pytorch copied to clipboard