SpeechSplit icon indicating copy to clipboard operation
SpeechSplit copied to clipboard

data preprocessing and final loss value

Open janeyeon opened this issue 5 years ago • 9 comments

Hi. I wanted to ask if you performed data normalization of an audio after trimming all the silences! And if you did, what method did you use? (maybe link to a paper or lecture or some package, please?)

What was final validation loss of G and P after training is almost done? My result is something like this.. and I'm not sure if it's an okay number. image

janeyeon avatar Aug 27 '20 02:08 janeyeon

Please refer to the data preparation code for details

auspicious3000 avatar Aug 28 '20 13:08 auspicious3000

@janeyeon how where you able to train both G and P at the same time? willing to share your code?

steven850 avatar Oct 09 '20 13:10 steven850

You don't need to train them at the same time.

auspicious3000 avatar Oct 09 '20 13:10 auspicious3000

Is there code available to train P, Im only seeing G in the solver.py

steven850 avatar Oct 09 '20 14:10 steven850

You can simply replace G with P along with some other minor modifications.

auspicious3000 avatar Oct 09 '20 14:10 auspicious3000

Hi . Did you do some preprocessing such as trimming silence and cropping in the training data? Or Have you done some other changes in the code...? In my expeiment, the final validation loss is much higher than yours. I want to find the reason... Thank you in advance.

XintaoZhao0805 avatar Oct 26 '20 09:10 XintaoZhao0805

All preprocessing steps are in the code, except trimming silence. But I don't think they will make any fundamental difference. Your loss value looks fine.

auspicious3000 avatar Oct 26 '20 09:10 auspicious3000

@janeyeon @steven850 any ideas how to fix this problem?

thank you

https://github.com/auspicious3000/SpeechSplit/issues/28

FurkanGozukara avatar Jan 17 '21 13:01 FurkanGozukara

Can you share this code of yours? Thank you very much!

9527950 avatar Mar 06 '23 01:03 9527950