webstruct
webstruct copied to clipboard
NER toolkit for HTML data
Results
23
webstruct issues
Sort by
recently updated
recently updated
newest added
Traceback (after trying to `NER.annotate()` https://github.com/scrapinghub/webstruct/blob/master/webstruct_data/corpus/business_pages/source/301.html page): ``` ValueError Traceback (most recent call last) in () 9 try: 10 with open(fn, 'rb') as f: ---> 11 annotated = ner.annotate(f.read()) 12...
i.e. ``` python parent = elem.getparent() pos = parent.index(elem) ``` this should capture the case where first element is the subject and second is the content
enhancement
Bumps the pip group in /example with 1 update: [tqdm](https://github.com/tqdm/tqdm). Updates `tqdm` from 4.15.0 to 4.66.3 Release notes Sourced from tqdm's releases. tqdm v4.66.3 stable cli: eval safety (fixes CVE-2024-34062,...
dependencies