openedgar
openedgar copied to clipboard
Dockerize
Hi, I thought I'd dockerize this project to make the setup less onerous.
$ dc up -d
# => should see all services start
$ dc run --rm shell
# > from openedgar.processes.edgar import download_filing_index_data, process_all_filing_index
# > download_filing_index_data(year=2018)
# => ... ('edgar/daily-index/2018/QTR4/form.20181231.idx', True, False)]
There's likely a few things I'm missing, so consider this pull request a WIP for the time being
When trying to do the 2018 example, I get the following in the celery process:
[2021-01-24 07:01:14,020: ERROR/MainProcess] Task openedgar.tasks.process_filing_index[2512eba5-5f2b-4a25-aebf-7f5ca8db26cb] raised unexpected: ConnectionError(MaxRetryError('None: Max retries exceeded with url: /rmeta/text (Caused by None)',),)