VoiceCraft
VoiceCraft copied to clipboard
train.txt and validation.txt generation from extracted_codes_and_phonemes
Thanks for this amazing work to benefit the speech research community.
Just wondering, is the provided train.txt and validation extracted from the XL split of gigaspeech? In the manifest file, are the three columns "0 name codec_number"? Could you maybe also provide the script to generate them from the processed feature folder path/to/store_extracted_codes_and_phonemes please? Just in case someone wants to test it on a smaller dataset split or on a different dataset? Thank you.
Thanks!
I'm currently resolving paper reviews so wouldn't have the capacity to update this repo. But yes the three columns are "0 name codec_number", codec_number means how long is the codec sequence (for each codebook).
once you have the processed codes and phonemes, to generated the manifest, you just need to record the path and code sequence length.
Thanks for the explanation.
Hi @jasonppy @qingju-flwls I'm trying to do it on a different dataset; is it possible for any one of you to share the script to generate the train.txt and validation.txt Thanks