asteroid icon indicating copy to clipboard operation
asteroid copied to clipboard

Using custom data for model training and testing

Open leandermaben opened this issue 2 years ago • 4 comments

I need to train and evaluate the ConvTasNet model on speech enhancement for a custom dataset. How should I go about formatting the data, and what changes need to be made in the run.sh file.

leandermaben avatar Apr 23 '22 21:04 leandermaben

Hey, thanks for the issue !

Have you had a look at already implemented speech enhancement dataset, how do they look like ?

mpariente avatar Apr 24 '22 18:04 mpariente

Hi, thanks for the response. By dataset, I don't mean the dataset class. I mean I want to use my own data ( clean and noisy .wav files) for training instead of wham or any of the other ones. I was hoping to do this directly using the ConvTasNet recipe by just changing the parameters in run.sh. Would this be possible?

leandermaben avatar Apr 24 '22 20:04 leandermaben

Not directly, not. Unless you create the json file that the WHAM dataset expects for your data, which is quite simple.

Otherwise, you need to create a dataset class for your own data, that's why I suggest you to look at how the datasets are implemented.

mpariente avatar Apr 25 '22 05:04 mpariente

Sure, I will look into it. Thank You.

leandermaben avatar Apr 25 '22 11:04 leandermaben