webstruct icon indicating copy to clipboard operation
webstruct copied to clipboard

NER toolkit for HTML data

Results 23 webstruct issues
Sort by recently updated
recently updated
newest added

Traceback (after trying to `NER.annotate()` https://github.com/scrapinghub/webstruct/blob/master/webstruct_data/corpus/business_pages/source/301.html page): ``` ValueError Traceback (most recent call last) in () 9 try: 10 with open(fn, 'rb') as f: ---> 11 annotated = ner.annotate(f.read()) 12...

i.e. ``` python parent = elem.getparent() pos = parent.index(elem) ``` this should capture the case where first element is the subject and second is the content

enhancement

Bumps the pip group in /example with 1 update: [tqdm](https://github.com/tqdm/tqdm). Updates `tqdm` from 4.15.0 to 4.66.3 Release notes Sourced from tqdm's releases. tqdm v4.66.3 stable cli: eval safety (fixes CVE-2024-34062,...

dependencies