deep-clustering Test audio sample clipped.

Test audio sample clipped.

Open kchanqvq opened this issue 7 years ago • 4 comments

I'm using WSJ0 dataset, and the model convergences. When I use the utility audio_test.py, however, I get only 3 second output when I feed it a 5 second mixed audio. Is that any issue in this python script causing the audio to be clipped?

Jan 04 '18 17:01 kchanqvq

Yeah, the test script just uses 3 chunks of frames. You can change that by modifying the AudioSampleReader.gen() into a python generator and iterate through all the frames in audio_test.py.

Jan 05 '18 03:01 zhr1201

@zhr1201 Have you ever encountered audio files that cannot be played after voice separation？ I find the separate wav file's format as follow: Input File : 'out_1.wav' Channels : 1 Sample Rate : 16000 Precision : 53-bit Duration : 00:00:01.61 = 25792 samples ~ 120.9 CDDA sectors File Size : 206k Bit Rate : 1.02M Sample Encoding: 64-bit Floating Point PCM

when convert the former wav format as follow, which has't voice. my model is: 链接:https://pan.baidu.com/s/1jWo9fgoaTTN7Akeh5RbupA 密码:ax1s

Can you give me some advice?

Channels : 1 Sample Rate : 8000 Precision : 16-bit Duration : 00:00:03.13 = 25055 samples ~ 234.891 CDDA sectors File Size : 50.2k Bit Rate : 128k Sample Encoding: 16-bit Signed Integer PCM

Apr 08 '18 08:04 dearwind153

@dearwind153 All the sample point are all 0 in your file. Guess some error happened and I am not very sure. You probably should run that step by step to see what is wrong.

Apr 09 '18 01:04 zhr1201

@zhr1201 , Ok, thank you for your reply!

Apr 09 '18 01:04 dearwind153

deep-clustering deep-clustering copied to clipboard

Test audio sample clipped.

deep-clustering
deep-clustering copied to clipboard