Listening-to-Sound-of-Silence-for-Speech-Denoising icon indicating copy to clipboard operation
Listening-to-Sound-of-Silence-for-Speech-Denoising copied to clipboard

Inference Question

Open sstrelnikoff opened this issue 3 years ago • 4 comments

Could you provide more information as to how to run the inference models? For example, how do you modify the code to point to the dataset? Any extra details or examples would be greatly appreciated!

sstrelnikoff avatar Mar 03 '21 08:03 sstrelnikoff

I have the same problem, the explanations are so vague.

ghost avatar Mar 03 '21 14:03 ghost

If you want to use your own data, then follow the step to preprocess the data first. If you use the provided sample data, then you only need to modify the csv file so that it points to the correct wav file locations.

henryxrl avatar Mar 03 '21 20:03 henryxrl

Thanks for your response! I'm not sure what you mean by "modify the csv file so that it points to the correct wav file locations" since, to my understanding, the csv file is only meant to contain the names of the audio files and not their directories. How does predict.py know where the data is stored if the code isn't modified and it's not given as an input? Does it have to be in the directory structure given in the example?

sstrelnikoff avatar Mar 03 '21 21:03 sstrelnikoff

I have a issue about the pesq metrics. I see that you use the pypesq package to compute the pesq. But this package just can compute narrow band pesq version. The demand dataset is 16k sampling rate and the baseline model also provide the wide band pesq version. The two values will be different.

yunzqq avatar Jun 04 '21 07:06 yunzqq