uspto-patent-data-parser icon indicating copy to clipboard operation
uspto-patent-data-parser copied to clipboard

A python tool for reading, parsing and finding patent using the United States Patent and Trademark (USPTO) Bulk Data Storage System.

Results 8 uspto-patent-data-parser issues
Sort by recently updated
recently updated
newest added

When looking for claim data, only the first line of claim data is ingested. Claims can contain many lines of text. An example: ```xml 1. An imaging lens system including,...

When parsing the bibliographical information, we just insert the keys ```python invention_title = root_tree.find(invention_title_path) document_data = {} if publication_info != None: publication_reference_info = {element.tag: element.text for element in list(publication_info)} document_data...

Moreover, I suggest this func should be changed like this, because I meet the encoding problem: ``` def read_and_parse_txt_from_disk(path_to_file,data_items): try: with open(path_to_file,'r',encoding='utf-8') as f: txt = f.read() except: with open(path_to_file,'r',encoding='latin1')...

Hello coder, when I try to parse the data of 1998, there's an error, the func "def get_patents_list" will return a null list, if I change the code to this:...

BRFSUM elements are present in the current (xml4) supto format. But read_and_parse_xml4_from_disk file has no logic to extract it.

I noticed the parser returned the `doc-number` rather than `patent number` for the patents. Although one can search a patent using `doc-number`, I cannot find a mapping for `doc-number` vs....

When Upsto updates its Data, Do they add new files or It's possible for them to edit existing ones?

Bugfix for ipcr classification