warp-rnnt icon indicating copy to clipboard operation
warp-rnnt copied to clipboard

Question about average_frames and reduction parmas

Open wl-junlin opened this issue 3 years ago • 1 comments

I want to have a stable loss which is rubust to labels_lengths when training. What value should I pass to this two parmas?

What's more, what is the approximate relationship between loss and actual wer? For example, if I want a wer aroud 0.5. How much should be the value of the loss?

wl-junlin avatar Sep 10 '21 07:09 wl-junlin

You shouldn't average over frames. If I remember correctly, theoretically it doesn't make sense. The loss is calculated for the entire utterance.

There is no a direct link between the RNN-T loss value and WER. I think a good analogue would be the negative log-likelihood and the accuracy of a classifier.

1ytic avatar Sep 12 '21 19:09 1ytic