Liger-Kernel
Liger-Kernel copied to clipboard
Adding ignore index support for divergence losses
trafficstars
🚀 The feature, motivation and pitch
We've had implemented KL divergence and JSD loss. Thanks to the community! This feature request is to: add an optional feature for ignoring index (need to and an extra index tensor input + an arg for ignore index) during the loss compute
Alternatives
No response
Additional context
No response