PatentChem
PatentChem copied to clipboard
urllib.error.HTTPError: HTTP Error 404: Not Found
I have downloaded the files and created a new environment using the provided "environment.yml" file successfully.
However, I got an error message when running the following in the terminal: python download.py --years 2023 --data_dir .
The error message is below:
(patents) C:\Users\alant>python download.py --years 2023 --data_dir .
Preparing to download all USPTO patents from 2023 ...
Found 18 releases from 2023
Directory for 2023 already exists.
Directory for 2023\I20230103 already exists.
2023\I20230103.tar: 0.00B [00:07, ?B/s]
Traceback (most recent call last):
File "C:\Users\alant\download.py", line 160, in
The URL that is in the code for download.py is correct and can be accessed through the browser, so I am confused as to why this error message was raised. This is the URL: https://bulkdata.uspto.gov/data/patent/grant/redbook/2023/. I got the same error message when running the code for other years.
Thanks very much for the help in troubleshooting this issue!