naturalspeech2-pytorch issues

Results 16 naturalspeech2-pytorch issues

Sort by recently updated

natural speech 3 code

the same Microsoft group has released code for natural speech 3 https://speechresearch.github.io/naturalspeech3/ Could this help?

jjsmcneil1113

the first pitch in the sample() as follow: https://github.com/lucidrains/naturalspeech2-pytorch/blob/659bec7f7543e7747e809e950cc2f84242fbeec7/naturalspeech2_pytorch/naturalspeech2_pytorch.py#L1478-L1479 the second pitch in the forward() of Naturalspeech2 as follow: https://github.com/lucidrains/naturalspeech2-pytorch/blob/659bec7f7543e7747e809e950cc2f84242fbeec7/naturalspeech2_pytorch/naturalspeech2_pytorch.py#L1543-L1556 1. Personally, I think the first pitch is from the...

a897456

unconditional version seems not to work correctly

Hi, thanks for the great job! I've follow the training process below and use pure audio without conditional input. I've tried dataset with 110000+ audios and also tried dataset with...

ethanyhzhang

multiple GPU?

![image](https://github.com/lucidrains/naturalspeech2-pytorch/assets/28752526/5c696602-a80e-4b37-988d-d2b591f8d863) Hi, @lucidrains I found that when step=1000, the system could save the .pt file automatically and output two audio files at the same time, but at this time, a...

a897456

loss.backward()??

In Usage: `loss = diffusion(raw_audio)` `loss.backward()` Thank you for your work, very nice! And I'm sorry, as a newbie, I have to ask two stupid questions: 1. Where does this...

a897456

Question about generate speech

heya my lad, I have run you Usage code for generate speech, but nothing output. Could you please give a full example to accept a prompt and text, then generating...

text2speechClone

@wonwooo

Can you provide the training code for that model? ___ import torch from naturalspeech2_pytorch import Trainer, EncodecWrapper, Model, NaturalSpeech2, SpeechPromptEncoder codec = EncodecWrapper() def main(): model = Model( dim =...

ksl103177

Inference code questions

Can you also share the inference code? Thank you.

ksl103177

Model study questions

Please let me know the format of the data to train the model and a detailed guide to train the model. Thank you.

ksl103177

Duration pitch loss not used.

Hello, I was looking into your code and it seems like the code does not consider the duration_pitch_loss. https://github.com/lucidrains/naturalspeech2-pytorch/blob/659bec7f7543e7747e809e950cc2f84242fbeec7/naturalspeech2_pytorch/naturalspeech2_pytorch.py#L1522 Maybe, it might be related to the aux_loss you have made....

jaykim9870

naturalspeech2-pytorch
naturalspeech2-pytorch copied to clipboard

Metadata

natural speech 3 code

two pitch?

unconditional version seems not to work correctly

multiple GPU?

loss.backward()??

Question about generate speech

@wonwooo

Inference code questions

Model study questions

Duration pitch loss not used.

← Metadata

Owner

Metadata

naturalspeech2-pytorch naturalspeech2-pytorch copied to clipboard

Metadata

← Metadata

Owner

Metadata

naturalspeech2-pytorch
naturalspeech2-pytorch copied to clipboard