VoiceCraft icon indicating copy to clipboard operation
VoiceCraft copied to clipboard

train.txt and validation.txt generation from extracted_codes_and_phonemes

Open qingju-flwls opened this issue 1 year ago • 2 comments

Thanks for this amazing work to benefit the speech research community.

Just wondering, is the provided train.txt and validation extracted from the XL split of gigaspeech? In the manifest file, are the three columns "0 name codec_number"? Could you maybe also provide the script to generate them from the processed feature folder path/to/store_extracted_codes_and_phonemes please? Just in case someone wants to test it on a smaller dataset split or on a different dataset? Thank you.

qingju-flwls avatar Mar 27 '24 17:03 qingju-flwls

Thanks!

I'm currently resolving paper reviews so wouldn't have the capacity to update this repo. But yes the three columns are "0 name codec_number", codec_number means how long is the codec sequence (for each codebook).

once you have the processed codes and phonemes, to generated the manifest, you just need to record the path and code sequence length.

jasonppy avatar Mar 27 '24 17:03 jasonppy

Thanks for the explanation.

qingju-flwls avatar Mar 27 '24 18:03 qingju-flwls

Hi @jasonppy @qingju-flwls I'm trying to do it on a different dataset; is it possible for any one of you to share the script to generate the train.txt and validation.txt Thanks

meets2tarun avatar Oct 04 '24 08:10 meets2tarun