KevinGengGavo
KevinGengGavo
Also I wonder if the `--norm_name self` in `stage 4` is necessary. Though you mentioned that in `README.md`, the default `norm_name` before `stage 3` was `ljspeech`, so there will only...
Hi, I’ve tried the `pwg_TXHC` vocoder after `stage 5`, and it's somehow working now. However, the artifacts after `stage 6` are more than I expected. Here, I have attached several...
Beside, a version conflict occurred during `stage 6` at `tools/venv/lib/python3.10/site-packages/s3prl_vc/upstream/ppg_sxliu/stft.py, line 95`. Here is my Pytorch env. ``` torch 2.0.1 torch-complex 0.4.3 torchaudio 2.0.2 ``` Based on [torch.stft](https://pytorch.org/docs/2.0/generated/torch.stft.html?highlight=stft#torch.stft), the `return_complex`...
Hi @unilight, I would appreciate it if you could take a time to look at this. Thank you.
Hi @Jasmijn888 Thanks for your reply. I fixed this issue by adding `torch.view_as_real` at `tools/venv/lib/python3.10/site-packages/s3prl_vc/upstream/ppg_sxliu/stft.py`. Here is my line 69 to 96 ``` # or (Batch, Channel, Freq, Frames, 2=real_imag)...
@Jasmijn888 I'm more curious about you output after stage 6, how does it sound like? > Hi, I’ve tried the `pwg_TXHC` vocoder after `stage 5`, and it's somehow working now....
Hi @unilight, > Hi @KevinGengGavo, I've tried to run the code on my local server again but I did not encounter the overflow issue. I can only suspect it's because...