pudl
pudl copied to clipboard
TESTONLY - sharded ferc_to_sqlite in ci-integration
Splits off ferc_to_sqlite
step into sharded step that generates each dataset independently on a smaller runner. This way we should be able to speed up the process significantly.
Check out this pull request on
See visual diffs & provide feedback on Jupyter Notebooks.
Powered by ReviewNB
This is combination of pre-existing PR that reworks pytest and refactoring of ferc_to_sqlite
into separate matrix-based job. I'm aiming to turn this into separate PR when parallel-pytest
is merged, but this gives a preview into this proof of concept. I think that it would make sense to move the setup steps (micromamba + other steps) into reusable workflow that can be used here repeatedly now that we have more than just one instance that needs this.