crs-reports-website
crs-reports-website copied to clipboard
The build process for EveryCRSReport.com.
The following links give access denied error. files/20171113_IN10814_da39a3ee5e6b4b0d3255bfef95601890afd80709.pdf,files/20171113_IN10814_483dd4b1c3bedd3a8fb82fd19a6a761fb9e0ba8e.html files/20171113_R45017_da39a3ee5e6b4b0d3255bfef95601890afd80709.pdf,files/20171113_R45017_483dd4b1c3bedd3a8fb82fd19a6a761fb9e0ba8e.html files/20171113_IN10816_483dd4b1c3bedd3a8fb82fd19a6a761fb9e0ba8e.html T files/20171113_R44552_da39a3ee5e6b4b0d3255bfef95601890afd80709.pdf,files/20171113_R44552_483dd4b1c3bedd3a8fb82fd19a6a761fb9e0ba8e.html files/20171109_R43407_da39a3ee5e6b4b0d3255bfef95601890afd80709.pdf,files/20171109_R43407_483dd4b1c3bedd3a8fb82fd19a6a761fb9e0ba8e.html files/20171109_R45015_da39a3ee5e6b4b0d3255bfef95601890afd80709.pdf,files/20171109_R45015_483dd4b1c3bedd3a8fb82fd19a6a761fb9e0ba8e.html files/20171108_R44966_da39a3ee5e6b4b0d3255bfef95601890afd80709.pdf,files/20171108_R44966_483dd4b1c3bedd3a8fb82fd19a6a761fb9e0ba8e.html files/20171025_R44992_da39a3ee5e6b4b0d3255bfef95601890afd80709.pdf,files/20171025_R44992_483dd4b1c3bedd3a8fb82fd19a6a761fb9e0ba8e.html files/20080729_RL34601_da39a3ee5e6b4b0d3255bfef95601890afd80709.pdf,files/20080729_RL34601_61dc708afa779848195d4f82829ecc7482cedd3e.html
I am trying to download all of the reports. The requests hang indefinitely on the very first URL fed to my script: ```python import csv import requests base_url = 'https://everycrsreport.com'...
This is regarding https://github.com/JoshData/crs-reports-website/blob/ae193cee52e57860ed7e50fbeca3cc5fed82a693/process_incoming.py#L450. The good news is that there are only three unique exceptions for all the PDFs. (total of 52 problem files) See below for representative tracebacks. These...
As @DanielSchuman wrote in #3, > it also may be useful to have the citations to the US code link to the US Code (using the pre-built parser) or the...
✨ Added some new functionality for ePub! This commit aims at improving the ePub experience for CRS Reports. It adds a new dependency (PyMuPDF) to handle PDF parsing and generates...
With all of the buzz that ChatGPT is getting these days, many people may not realize there is an implementation designed to enable ChatGPT-like interaction with a GPT model trained...
I think this is trivial, but mentioning it anyway. Some CRS reports contain boxes with text. Yes, I don't know why. When they're rendered, the font size is understandably different....
This is probably too hard to accomplish. However, there appear to be a number of unusual line breaks. It could be that the rendering tool is misreading periods as a...
In some of the EPUB versions, the title for Figure 1 and the image accompanying Figure 1 are separated by text, whereas in the PDF version they are together. Examples...
It appears that the transformation from PDF to EPub has difficulty handling the tables. Perhaps they should be treated as images -- where they are turned into a full screen...