uis-rnn icon indicating copy to clipboard operation
uis-rnn copied to clipboard

Add support for estimation of crp_alpha

Open wq2012 opened this issue 5 years ago • 5 comments

Currently in this open source version, crp_alpha is passed in as an argument.

We need to add the support to estimate it from training data.

wq2012 avatar Nov 14 '18 21:11 wq2012

How to fix the total number of speakers? In most call center scenario, there are only 2 or 3 speakers.

fanlu avatar Jan 18 '19 08:01 fanlu

@fanlu The whole idea of UIS-RNN is to be able to handle unbounded number of speakers by learning from examples, instead of enforcing the number of speakers.

If you train UIS-RNN with call center audios where there are always 2 or 3 speakers, it should be able to predict at most 2 or 3 speakers, without requiring additional constraints.

However, since you asked, let me create a feature request issue for it. But likely we won't work on it for any time soon.

wq2012 avatar Jan 18 '19 15:01 wq2012

Hi, do you have any update about this issue? Or do you have any suggestion related to the input parameter adjustment when the system tends to add too many speakers?

suzinia avatar Aug 28 '19 21:08 suzinia

@suzinia Unfortunately no, since some core members have left the team.

You can try to locally apply #56 to constrain the number of speakers. It's not really very correct, but may solve your immediate problem.

wq2012 avatar Aug 28 '19 21:08 wq2012

Thanks, I'll try that out!

suzinia avatar Aug 28 '19 22:08 suzinia