webstruct icon indicating copy to clipboard operation
webstruct copied to clipboard

NER toolkit for HTML data

Results 23 webstruct issues
Sort by recently updated
recently updated
newest added

Is it possible to parse date from a webpage ?

It seems like webannotator extension is no longer available: https://addons.mozilla.org/en-US/firefox/addon/webannotator/ If it's not available, what are the other recommended options?

Webstruct looks like a really cool extension to have for any scraping enthusiast, so thank you for creating this! It would be really awesome if you guys could also release...

sometime the training data maybe plain text, instead of using python-crfsuite or any other CRF package, i still prefer to use webstruct because it has sklearn `pipeline` and some evaluation...

allow assign the BIO tags with fuzzy match. this can be useful when having some of partially labelled dataset.

Tokenizer from #15 had issues like not splitting a dot at the end of a sentence as a separate token ``` 40006,40007c40017 < community < . --- > community. 41148,41149c41158...

I added the features I created for Fireflax

To do: - [x] Make Wapiti BILOU compatible - [x] Add test for Wapiti with Bilou on - [x] Add test for wapiti.merge_top_n - [x] Add tests for grouping.py -...