music-source-separation icon indicating copy to clipboard operation
music-source-separation copied to clipboard

Try this architecture in the speech enhancement for speech recognition?

Open sigpro opened this issue 8 years ago • 5 comments

Can this architecture recover speech from the noisy speech?

sigpro avatar Jul 12 '17 01:07 sigpro

@sigpro The same architecture can be used for removing noise from speech or extracting single speech from dialog as well ;) Take a look at this reference paper: https://posenhuang.github.io/papers/Joint_Optimization_of_Masks_and_Deep%20Recurrent_Neural_Networks_for_Monaural_Source_Separation_TASLP2015.pdf

andabi avatar Jul 12 '17 01:07 andabi

@andabi Thank you for reply,will you try multi-channel for better performance?

sigpro avatar Jul 15 '17 06:07 sigpro

@sigpro yes i'll it if have some dataset ;) Do you know any dataset that has multi-channel ?

andabi avatar Jul 15 '17 06:07 andabi

@andabi I think CHiME challenge's dataset can be used.

sigpro avatar Jul 15 '17 08:07 sigpro

Just to extend this question. How do I modify the code to make this work with any dataset? Or is the code specifically created for singing voice separation? Would be awesome if this is a universal solution to many source separation tasks.

ghost avatar Jul 16 '17 21:07 ghost