adelacvg comments

Repositories
Issues
Comments

Results 45 comments of


                                            adelacvg

information on 24khz model

> hmm it seems my vqae training loss stuck, after 2 days, it stay the same,, the sample also not intagible from ground truth > > > Yes, just use...

information on 24khz model

> @adelacvg > > btw i change my vocab size gpt to 512, due multilinguality > > i just change the config > > ``` > "gpt":{ > "model_dim":768, >...

information on 24khz model

Ensure that the referenced mel is a short segment of audio to avoid GPT overfitting on the speaker's conditions. I have updated some parameters of the VQ-VAE, resulting in a...

information on 24khz model

Infer_diffusion function is the same as the infer function, `do_spectrogram_diffusion` part do the sample process.

information on 24khz model

For the GPT part, you can use acceleration frameworks similar to VLM, and they also support GPT2. For the diffusion part, you can adopt faster sampling methods with fewer sampling...