torch-pesq icon indicating copy to clipboard operation
torch-pesq copied to clipboard

Joint training SISDR and Pesq loss

Open Niteshkumarchaudhary opened this issue 1 year ago • 1 comments

Hi, what should be the alpha value for the joint SISDR and pesq loss training (SISDR_loss + alpha*pesq_loss) ? Firstly I checked the range of each loss (SISDR and pesq) then scaled them in the same range and used alpha = 0.5 (equal weightage) then what I observed that after training, in processed output significant amount of speech signal also gets suppressed. Please comment and let me know.

I have another question for the pesq loss below pesq = PesqLoss(val, sample_rate=44100, ) what is val in PesqLoss ? is it just for scaling the loss value or something else ?

Niteshkumarchaudhary avatar Aug 13 '24 13:08 Niteshkumarchaudhary

Hi! Thank you for your interest.

I have another question for the pesq loss below pesq = PesqLoss(val, sample_rate=44100, ) what is val in PesqLoss ? is it just for scaling the loss value or something else ?

The value in that context is the loss weight. In the experiments, SI-SDR and PESQ had same weighting (lamb=1/2).

[..] in processed output significant amount of speech signal also gets suppressed. Please comment and let me know.

that depends on your model architecture, I can't comment on the performance without knowing the architecture. For our model the improvement was non-neglectable but minor. Maybe you can perform a hyperparameter search for your loss weights to find a good balance.

bytesnake avatar Aug 14 '24 13:08 bytesnake

For further help, please reopen the issue

bytesnake avatar Sep 09 '24 14:09 bytesnake