naturalspeech2-pytorch icon indicating copy to clipboard operation
naturalspeech2-pytorch copied to clipboard

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Results 16 naturalspeech2-pytorch issues
Sort by recently updated
recently updated
newest added

the same Microsoft group has released code for natural speech 3 https://speechresearch.github.io/naturalspeech3/ Could this help?

the first pitch in the sample() as follow: https://github.com/lucidrains/naturalspeech2-pytorch/blob/659bec7f7543e7747e809e950cc2f84242fbeec7/naturalspeech2_pytorch/naturalspeech2_pytorch.py#L1478-L1479 the second pitch in the forward() of Naturalspeech2 as follow: https://github.com/lucidrains/naturalspeech2-pytorch/blob/659bec7f7543e7747e809e950cc2f84242fbeec7/naturalspeech2_pytorch/naturalspeech2_pytorch.py#L1543-L1556 1. Personally, I think the first pitch is from the...

Hi, thanks for the great job! I've follow the training process below and use pure audio without conditional input. I've tried dataset with 110000+ audios and also tried dataset with...

![image](https://github.com/lucidrains/naturalspeech2-pytorch/assets/28752526/5c696602-a80e-4b37-988d-d2b591f8d863) Hi, @lucidrains I found that when step=1000, the system could save the .pt file automatically and output two audio files at the same time, but at this time, a...

In Usage: `loss = diffusion(raw_audio)` `loss.backward()` Thank you for your work, very nice! And I'm sorry, as a newbie, I have to ask two stupid questions: 1. Where does this...

heya my lad, I have run you Usage code for generate speech, but nothing output. Could you please give a full example to accept a prompt and text, then generating...

Can you provide the training code for that model? ___ import torch from naturalspeech2_pytorch import Trainer, EncodecWrapper, Model, NaturalSpeech2, SpeechPromptEncoder codec = EncodecWrapper() def main(): model = Model( dim =...

Can you also share the inference code? Thank you.

Please let me know the format of the data to train the model and a detailed guide to train the model. Thank you.

Hello, I was looking into your code and it seems like the code does not consider the duration_pitch_loss. https://github.com/lucidrains/naturalspeech2-pytorch/blob/659bec7f7543e7747e809e950cc2f84242fbeec7/naturalspeech2_pytorch/naturalspeech2_pytorch.py#L1522 Maybe, it might be related to the aux_loss you have made....