docling icon indicating copy to clipboard operation
docling copied to clipboard

feat: Add DoclingParseV3 backend using high-level docling-parse API

Open cau-git opened this issue 10 months ago • 1 comments

Note: This is experimental and will change throughout future iterations.

Checklist:

  • [ ] Documentation has been updated, if necessary.
  • [ ] Examples have been added, if necessary.
  • [ ] Tests have been added, if necessary.

cau-git avatar Feb 06 '25 19:02 cau-git

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • [X] title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

🟢 Require two reviewer for test updates

Wonderful, this rule succeeded.

When test data is updated, we require two reviewers

  • [X] #approved-reviews-by >= 2

mergify[bot] avatar Feb 06 '25 19:02 mergify[bot]

TODO

  • [x] Put DoclingParseV1DocumentBackend back, mark as deprecated
  • [x] Correct handling of BoundingRectangle.to_bounding_box() when text cells are rotated, instead of fixing it in get_text_cells.
  • [x] Add pipeline options to control if we want to populate the page.parsed_page in ConversionResult.

cau-git avatar Mar 12 '25 12:03 cau-git

LGTM!

maxmnemonic avatar Mar 18 '25 09:03 maxmnemonic