temperature_scaling
temperature_scaling copied to clipboard
how to use temperature scaling for CTC loss?
The current implementation assumes a single prediction per input, which is plausible for many classification tasks. However, there is a stream of problems where the output might be a string of labels, as in Speech Recognition.
Assuming CTC Loss is used for speech recognition where predictions may have different length than the ground-truth labels, how to properly compute ECE in such cases?