loss_dropper
loss_dropper copied to clipboard
Adopting Loss truncation for ASR
Hi
can this be adopted for ASR (trained with CE loss) task ?
Our technique is fairly generic so likely will apply to any setting that uses cross-entropy. However, the specific details may differ depending on exactly how the loss is computed.