voice_activity_detection
voice_activity_detection copied to clipboard
Performance issue in /vad/training (by P3)
Hello! I've found a performance issue in input_pipeline.py: dataset.batch(batch_size)
(line 70) should be called before dataset.map(parse_func, num_parallel_calls=8)
(line 69), which could make your program more efficient.
Here is the tensorflow document to support it.
Besides, you need to check the function parse_func
called in dataset.map(parse_func, num_parallel_calls=8)
whether to be affected or not to make the changed code work properly. For example, if parse_func
needs data with shape (x, y, z) as its input before fix, it would require data with shape (batch_size, x, y, z) after fix.
Looking forward to your reply. Btw, I am very glad to create a PR to fix it if you are too busy.
Hello, I'm looking forward to your reply~
@DLPerf Where did you find the labeled dataset where the sequence is silence or voice activity.
If you have then could you please send me the link 😃