cylc-flow
cylc-flow copied to clipboard
Provenance data collection
(From offline chat with @oliver-sanders)
For science experiments, we should provide proper provenance data collection:
Everything that goes into obtaining a result: workflow execution, system info, user interaction, captured in a standard format for scientific integrity purposes.
We already collect some of this information automatically, but users have to roll their own ways of scraping it from workflow DB and logs, and add code to collect system info (e.g. on job hosts) themselves.
See also https://github.com/cylc/cylc-flow/issues/3491