FullSubNet-plus icon indicating copy to clipboard operation
FullSubNet-plus copied to clipboard

Control the strength of Enhancement

Open v-nhandt21 opened this issue 3 years ago • 3 comments

Can we control the strength of enhancement of Fullsubnet by config this?

image

v-nhandt21 avatar Oct 17 '22 10:10 v-nhandt21

Sorry, I don't quite understand your question. Generally speaking, the length of speech that needs to be enhanced is generally the same as the input. If you want to adjust it maybe you can adjust the input or output length?

RookieJunChen avatar Oct 20 '22 03:10 RookieJunChen

Sorry, I don't quite understand your question. Generally speaking, the length of speech that needs to be enhanced is generally the same as the input. If you want to adjust it maybe you can adjust the input or output length?

Excuse me, I mean Can we control the effect of enhancement, in some cases I need there is remain a little bit of noise so that the speech is more naturalness.

v-nhandt21 avatar Oct 20 '22 08:10 v-nhandt21

Oh, I see. Our model is end-to-end, so the degree of noise reduction depends on the data seen during training. If you want to reduce the degree of speech suppression, you can try training with data with a higher SNR; or train the model with asymmetric loss [1]. [1] Quan Wang et al. “VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition..” conference of the international speech communication association (2020): n. pag.

RookieJunChen avatar Oct 20 '22 11:10 RookieJunChen