reduce pydra overhead

Open satra opened this issue 2 months ago • 2 comments

in the senselab package which processes audio (mostly through python functions), we are seeing significant pydra overhead. to try and debug a bit of this, i looked at scalene (github.com/plasma-umass/scalene) and it seems like we could incorporate it into the codebase, or use it to optimize bits of it. there is an ai model integration into scalene.

Nov 06 '25 14:11 satra

Thanks Satra, I can take a look after SFN

Nov 06 '25 17:11 djarecka

As part of this issue, one of the other maintainers of senselab tried to do some performance testing and found the following when using 535 EMO-DB short clips

Task	With Pydra (s)	Without (s)
Read audios	12.5	0.5
Save audios	10.63	0.2
Downmix to mono	15.0	0.1
Resample	16.0	0.1
OpenSMILE features	33.0	20.0
Praat/parselmouth features	93.7*	246.1
Torchaudio features	37.8	16.2
Audiomentations (gain)	13.5	0.3
Torchaudiomentations (gain)	5.3	0.2

*on Pydra 0.23 it was even faster (~70s)

Nov 06 '25 18:11 wilke0818