cdx_toolkit
cdx_toolkit copied to clipboard
feat: Adding settings, utils, write to S3, and writer close
This PR integrates a couple of general changes from the EOT PR (https://github.com/cocrawler/cdx_toolkit/pull/54):
- Settings variables are loaded from environment variables in
settings.py - Common CLI methods are moved to
utils.py - Read and write to S3 (via fsspec).
- Missing
writer.close()statement is added to CLI and example.