zyy-fc comments

Results 9 comments of


zyy-fc

details about the noisy_train.mat and target_train.mat

I am really impressed by your work and got a few questions in terms of how you process the training data. Actually, I can not understanding the meaning of "Dimension:...

details about the noisy_train.mat and target_train.mat

Thank you for your reply. 1. To train the enhancement model, the input should be single-channel. However, is the input to enhancement model the one channel signal of multichannel signal...

details about the noisy_train.mat and target_train.mat

Could you tell me the version of tensorflow that you used?

the input to "bawn_pr_multi_gpu_train.py"

> @yuanyuan0209 You could simply convert them to the proper datatype. For inference, I used the "Fast WaveNet" algorithm, which is a little tricky to implement. > However, you could...

the input to "bawn_pr_multi_gpu_train.py"

> Hello, have you solved the first problem? I am also a rookie, could you tell me how to solve it? > Thanks! I am sorry and I have not...

> @XIEchoAH 您好。按照网址下载的噪音Nonspeech是20KHz的，在训练之前，我用Audition把它们重采样为16KHz. 另外，prepare_data.py中函数create_mixture_csv中read_audio没有加采样频率，如果音频不是16KHz，确实会造成如你所说的情况（第3点）。其他的就没做什么变动了。您好，打扰您一下！我最近在跑该开源代码，训练数据和测试数据都是按照作者所说的那样设置的，考虑到你们说的第3种情况，我也事先将数据重采样到16kHz。当第一次迭代次数没有修改时(10000)时，增强后的语音失真非常严重，PESQ值在0.4左右，现在设置迭代次数为50000，但是训练过程中发现test loss一直降不下来，想问下你们有遇到过这种情况吗？谢谢！ ![1](https://user-images.githubusercontent.com/32134929/48313910-7fe4fc80-e5fd-11e8-92a9-bb4a6924a647.png)

复现代码结果不理想

> 您好，在使用了您的代码后，我有一些地方不确定，期待您能在百忙之中抽空解答一下： > > 1. 测试代码时，使用的语音和噪声数据与您所给一致且采样频率为16KHz，所有参数未做修改，train_loss与您所给一致，但test_loss却偏大，如图所示 > ![1](https://user-images.githubusercontent.com/29767168/39418213-ae5292f0-4c8b-11e8-9659-838fa41b161f.jpg) > > > PESQ结果如下，仅为1.83 > ![qq 20180430150456](https://user-images.githubusercontent.com/29767168/39417683-955b8e26-4c88-11e8-870c-77fa97e95629.jpg) > > 在查看代码后冒昧地问一下是不是您的代码只是DNN的baseline，并没有NAT ，dropout和post-processed的部分，所以复现代码时的结果不良好，如有还望您指出在哪一部分。 > > 1. 在prepare_data.py中，read_audio的部分有resample的操作，想问一下如果我的原始语音和噪声都不是16KHz的，resample之后再训练和测试和直接用16KHz的数据训练和测试的实验结果会有差别么？ > > 2. 在prepare_data.py中，您的操作是先生成了mixture_csvs，然后按照mixture_csvs中的noise_onset...

复现代码结果不理想

> Thanks for your interest. 这个代码目前仅仅是个baseline，如果性能不如 baseline性能2.08+-0.24，请检查下1）“PESQ dose not support long path/folder name, so please shorten your path/folder name. ” ； 2）一些配置问题这个代码里没有加 NAT, Post-processing，有这些功能的模型（可测试任何语音)： https://github.com/yongxuUSTC/DNN-Speech-enhancement-demo-tool Best regards yong...