adelacvg

Results 45 comments of adelacvg

> hmm it seems my vqae training loss stuck, after 2 days, it stay the same,, the sample also not intagible from ground truth > > > Yes, just use...

> @adelacvg > > btw i change my vocab size gpt to 512, due multilinguality > > i just change the config > > ``` > "gpt":{ > "model_dim":768, >...

Ensure that the referenced mel is a short segment of audio to avoid GPT overfitting on the speaker's conditions. I have updated some parameters of the VQ-VAE, resulting in a...

Infer_diffusion function is the same as the infer function, `do_spectrogram_diffusion` part do the sample process.

For the GPT part, you can use acceleration frameworks similar to VLM, and they also support GPT2. For the diffusion part, you can adopt faster sampling methods with fewer sampling...