What does this PR do?

This PR is adding the distribution params and the name of the distribution to the SampleTSPredictionOutput. Additionally, it enables the distribution outputs in the autoformer, informer, and time series transformer.

Fixes #29556

Before submitting

[ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
[x] Did you read the contributor guideline, Pull Request section?
[x] Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
[x] Did you make sure to update the documentation with your changes? Here are the documentation guidelines, and here are tips on formatting docstrings.
[x] Did you write any new necessary tests?

I extended existing tests

Who can review?

@kashif

Mar 16 '24 22:03 benHeid

thanks @benHeid having a look!

Mar 16 '24 22:03 kashif

@benHeid Can we confirm that the param output is not num_samples copies of the parameter?

Mar 17 '24 12:03 kashif

@benHeid Can we confirm that the param output is not num_samples copies of the parameter?

Sorry for the late answer. I was a bit blocked with various stuff the last weeks.

For the Informer, each time step a new param is created. This param is fed into the distribution object to create the next sample. Thus, I assume that this is definitely not the same for each future value.
For the TimeSeriesModel Transformer, the same argument as for the Informer could be applied.
For the autoformer, the params are the last prediction_length values of the parameter_projection (several neural networks). Thus, this should also not be a copy. If desired, I can add a test regarding that.

Furthermore, I just seen that we also need to report the loc and scale values from the normalization step. Here I am wondering, if it might be possible to adapt the parameters in a way that they are reflecting the scale and loc. However, I am unsure if it is that easy for each distribution.. Otherwise, we could just return these values too.

Apr 01 '24 08:04 benHeid

@benHeid yes so for gaussian and student-t the loc and scale can be used to get the parameters of the corresponding scaled distribution. However for neg. bin. i haven't figured out how to do that with the loc parameter... and so only the No-ops scaling and "mean" scaling can be used at the moment with neg. bin. If you can figure out how the loc param can be applied to the neg. bin params I would appreciate it.

Apr 01 '24 08:04 kashif

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Apr 26 '24 08:04 github-actions[bot]

@kashif unfortunately, I haven't found a solution. Thus, I would propose that the scaling information are also passed to SampleTSPredictionOutput so that the user has the possibility to use them.

May 01 '24 10:05 benHeid

yes that would be ideal... and then one can either create the (distrb(scaled_params).sample() + loc) * scale or via a transformed distribution

May 01 '24 10:05 kashif

Exactly. I will check again if everything is working as expected locally and ping you again here.

May 01 '24 10:05 benHeid

@kashif, I added the scaling params. Thus, I suppose it is ready for review now :)

May 12 '24 07:05 benHeid

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Jun 05 '24 08:06 github-actions[bot]

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Jun 14 '24 08:06 HuggingFaceDocBuilderDev

Regarding the failed tests. It seems that at least a significant part of the failure is not related to my changes or? E.g., I haven't changed any examples and also the linter is proposing file changes in files that I haven't touched.

Jul 02 '24 06:07 benHeid

@benHeid agree... let me check

Jul 02 '24 06:07 kashif

I applied fix-copies and pushed. Is this correct or should I revert this?

Jul 02 '24 12:07 benHeid

lets revert all the un-related changes... hopefully that also fixes the merge conflicts?

Jul 02 '24 12:07 kashif

I reverted. The only files that are getting change by fix-copies are: modeling_bigbird_pegasus.py, modeling_cohere.py, modeling_gemma.py, and test_tokenization_roc_bert.py. Since I haven't changed them I reverted these changes.

Jul 02 '24 13:07 benHeid

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Jul 27 '24 08:07 github-actions[bot]

@kashif what is the current state ?

Aug 05 '24 08:08 benHeid

i believe the last thing was to get the API also working with patchTST

Aug 05 '24 08:08 kashif

transformers
transformers copied to clipboard

Add distribution params to time series output

What does this PR do?

Before submitting

Who can review?

transformers transformers copied to clipboard

Add distribution params to time series output

What does this PR do?

Before submitting

Who can review?

transformers
transformers copied to clipboard