data_extractor issues

build(deps): bump lxml from 4.8.0 to 4.9.1

1

Bumps [lxml](https://github.com/lxml/lxml) from 4.8.0 to 4.9.1. Changelog Sourced from lxml's changelog. 4.9.1 (2022-07-01) Bugs fixed A crash was resolved when using iterwalk() (or canonicalize()) after parsing certain incorrect input. Note...

dependabot[bot]

dependencies

how to remove or deactivate the super class sub-extractors properly

#68 shows a bad example of how to remove the super class sub-extractors. But what's the right thing to do?

linw1995

enhancement

help wanted

Add elementpath for XPath2.0 support.

Add [elementpath](https://github.com/sissaschool/elementpath) for XPath2.0 support. Make it as a optional dependency.

linw1995

enhancement

Better exception message in Extractor.

linw1995

enhancement

添加中文文档

linw1995

docs

Iterative extracting.

- `python-json-rw` is not designed for iterable extracting - `lxml.xpath` is not designed for iterable extracting But implement `AbstractSimpleExtractor.iter_extract` and `AbstractComplexExtractor.iter_extract` can be able to reduce the memory usage when...

linw1995

enhancement

build(deps): bump urllib3 from 2.0.4 to 2.0.7

1

Bumps [urllib3](https://github.com/urllib3/urllib3) from 2.0.4 to 2.0.7. Release notes Sourced from urllib3's releases. 2.0.7 Made body stripped from HTTP requests changing the request method to GET after HTTP 303 "See Other"...

dependabot[bot]

dependencies

data_extractor
data_extractor copied to clipboard

Metadata

build(deps): bump lxml from 4.8.0 to 4.9.1

how to remove or deactivate the super class sub-extractors properly

Add elementpath for XPath2.0 support.

Better exception message in Extractor.

添加中文文档

Iterative extracting.

build(deps): bump urllib3 from 2.0.4 to 2.0.7

← Metadata

Owner

Metadata

data_extractor data_extractor copied to clipboard

Metadata

build(deps): bump lxml from 4.8.0 to 4.9.1

how to remove or deactivate the super class sub-extractors properly

Add elementpath for XPath2.0 support.

Better exception message in Extractor.

添加中文文档

Iterative extracting.

build(deps): bump urllib3 from 2.0.4 to 2.0.7

← Metadata

Owner

Metadata

data_extractor
data_extractor copied to clipboard