gluonts icon indicating copy to clipboard operation
gluonts copied to clipboard

Specifying Loss Distributions for Temporal Fusion Transform

Open esbraun opened this issue 2 years ago • 3 comments

Description

Currently the mxnet temporal fusion transformer only allows for quantile loss. It would be helpful to be able to specify other distributions (i.e. negative binomial for count data).

Would this change be possible?

References

The library pytorch_forecasting allows specification of the loss for its temporal fusion transformer implementation.

esbraun avatar Jul 14 '22 16:07 esbraun

note I have such a version for pytorch: https://github.com/kashif/pytorch-transformer-ts/tree/main/tft

kashif avatar Jul 14 '22 17:07 kashif

Thank you! Is this something that you're planning on merging eventually as well?

esbraun avatar Jul 20 '22 16:07 esbraun

yes eventually

kashif avatar Jul 20 '22 16:07 kashif

Update: I've empirically found significant improvement using the negative binomial distribution for a operations related count valued forecasting problem using the TFT implementation from kashif's repo over the mxnet TFT gluonts implementation and other gluonts algorithms. The implementation, however, lacks the full featured covariate support of the gluonts mxnet model. Hopefully the support for more loss functions could be wrapped into remediation for #2416.

esbraun avatar Jan 12 '23 20:01 esbraun