voice_activity_detection Performance issue in /vad/training (by P3)

Performance issue in /vad/training (by P3)

Open DLPerf opened this issue 2 years ago • 2 comments

Hello! I've found a performance issue in input_pipeline.py: dataset.batch(batch_size)(line 70) should be called before dataset.map(parse_func, num_parallel_calls=8)(line 69), which could make your program more efficient.

Here is the tensorflow document to support it.

Besides, you need to check the function parse_func called in dataset.map(parse_func, num_parallel_calls=8) whether to be affected or not to make the changed code work properly. For example, if parse_func needs data with shape (x, y, z) as its input before fix, it would require data with shape (batch_size, x, y, z) after fix.

Looking forward to your reply. Btw, I am very glad to create a PR to fix it if you are too busy.

Aug 27 '21 13:08 DLPerf

Hello, I'm looking forward to your reply~

Nov 04 '21 09:11 DLPerf

@DLPerf Where did you find the labeled dataset where the sequence is silence or voice activity.

If you have then could you please send me the link 😃

Feb 24 '22 10:02 dhirajnitk

voice_activity_detection voice_activity_detection copied to clipboard

Performance issue in /vad/training (by P3)

voice_activity_detection
voice_activity_detection copied to clipboard