uspto-opendata-python icon indicating copy to clipboard operation
uspto-opendata-python copied to clipboard

Outdated dependencies

Open rahul-gj opened this issue 5 years ago • 4 comments

Can we update the dependencies?

It is un-necessarily uninstalling the updated packages and installing old packages.

  Found existing installation: urllib3 1.24.1
    Uninstalling urllib3-1.24.1:
      Successfully uninstalled urllib3-1.24.1
  Found existing installation: idna 2.8
    Uninstalling idna-2.8:
      Successfully uninstalled idna-2.8
  Found existing installation: requests 2.21.0
    Uninstalling requests-2.21.0:
      Successfully uninstalled requests-2.21.0
  Found existing installation: lxml 4.3.1
    Uninstalling lxml-4.3.1:
      Successfully uninstalled lxml-4.3.1
  Found existing installation: beautifulsoup4 4.7.1
    Uninstalling beautifulsoup4-4.7.1:
      Successfully uninstalled beautifulsoup4-4.7.1
beautifulsoup4-4.6.0,  lxml-4.2.5 requests-2.18.4 urllib3-1.22

rahul-gj avatar Mar 11 '19 10:03 rahul-gj

Dear Rahul,

sure we can. May I humbly ask whether this library is still of any use for the community? As we recognized by #5 or the like, some of the services offered by USPTO might have gone defunct in the meanwhile?

With kind regards, Andreas.

amotl avatar Mar 11 '19 12:03 amotl

The ped system is working fine. I am using it. but it is taking a lot of time for doing a single request.

from uspto.peds.client import UsptoPatentExaminationDataSystemClient
client = UsptoPatentExaminationDataSystemClient()
req_json = client.download_document(
    type='patent',
    number='9434717',
    format="json"
)

#and lot of stuff from obtained data.

This code takes too much time to complete. I don't know why. on https://ped.uspto.gov/peds/ when I enter 9434717 it opens in a fraction of second. but again request for packaging takes time. but when I enter '9434717, 9326936, 8815838' in the Patent Number field and it gives me 3 result. requesting a package gives me zip with year wise jsons in it and it takes same time as single download.

so I tried similar way with this library.

req_json = client.download_document(
    type='patent',
    number='9434717, 9326936, 8815838',
    format="json"
)

gave an error. so how can I download document for multiple patent number at once in synchronous mode? or you can give me peds api smple url and I will try to help here. I have tried reading official docs but my current knowledge is not sufficient to understand it.

Thanks.

rahul-gj avatar Mar 13 '19 04:03 rahul-gj

Dear Rahul,

thanks for letting me know that the ped system still works and that you are using it. As far as I recognized back then when implementing this library, the download process was asynchronous / bulk mode only somehow.

At least, I tried to focus on and automate the "Request Download" path, because

The Patent Examination Data system (PEDs) shows the first 20 results in the dataset. To see more results, click the "Request Download" link.

If you are into fetching single patents only, the download process is pretty lengthy, while requesting single documents and viewing their bibliographic data is more quick, as you already recognized:

on https://ped.uspto.gov/peds/ when I enter 9434717 it opens in a fraction of second.

Thanks for your insights! While I can't promise a timeframe, I recognize the ad hoc way of acquisition as a valuable alternative and will try to implement it.

May I humbly ask you to open another issue for that à la "Synchronously download documents for multiple patent numbers" to stay clean on the bookkeeping side? Then, we can start to investigate further there. Thanks in advance!

With kind regards, Andreas.

amotl avatar Mar 13 '19 10:03 amotl

May I humbly ask you to open another issue for that à la "Synchronously download documents for multiple patent numbers" to stay clean on the bookkeeping side?

Thanks already, we are now tracking this at #7.

amotl avatar Mar 13 '19 21:03 amotl