flyte icon indicating copy to clipboard operation
flyte copied to clipboard

[Core feature] Enable pyflyte backfill to ignore cache

Open dennisobrien opened this issue 1 year ago • 1 comments

Motivation: Why do you think this is important?

When launching a workflow from the web UI, it is possible to select the advanced option "Overwrite cached outputs" in order to have that run ignore any cached values that may be available. This functionality is not available to the pyflyte backfill command.

Ideally when running a backfill, it would be possible to ignore cache for all executions in that backfill. Backfills are sometimes necessary when upstream data had a data quality issue which was later fixed, requiring downstream jobs to re-run on the restated data.

Goal: What should the final outcome look like, ideally?

The pyflyte backfill command should have a flag named something like --ignore-cache or --overwrite-cached-outputs that behaves similar to the web UI when selecting "Overwrite cached outputs".

Describe alternatives you've considered

I don't know of simple alternatives to running a backfill from the command line that has an option to overwrite the cache.

Propose: Link/Inline OR Additional context

No response

Are you sure this issue hasn't been raised already?

  • [X] Yes

Have you read the Code of Conduct?

  • [X] Yes

dennisobrien avatar Feb 19 '24 01:02 dennisobrien

🤖

It seems like you've got this under control, if you want help or have specific questions, let me know what I can do for you!

About Dosu

This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.

Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

If you want to continue the conversation, start your reply with @dosu-bot.

dosubot[bot] avatar Feb 19 '24 01:02 dosubot[bot]