docling icon indicating copy to clipboard operation
docling copied to clipboard

fix: Introduce Image format options in CLI. Silence the tqdm downloading messages.

Open nikos-livathinos opened this issue 11 months ago • 1 comments
trafficstars

This is a fix to:

  • Introduce format options in the docling CLI for Image with the same pipeline_options as for PDF. This allows the CLI parameters to be applied in case of image inputs.
  • Add RapidOcrOptions to the Union of ocr_options for PdfPipelineOptions.
  • Silence the tqdm messages during the downloading of model files.

Issues resolved by this Pull Request: Resolves #505 #208

Checklist:

  • [x] Documentation has been updated, if necessary.
  • [x] Examples have been added, if necessary.
  • [x] Tests have been added, if necessary.

nikos-livathinos avatar Dec 08 '24 18:12 nikos-livathinos

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • [X] title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

mergify[bot] avatar Dec 08 '24 18:12 mergify[bot]

Maybe one more check to do is, if the input format is an image, OCR must be activated for its pipeline options (independent of the global OCR choice)

cau-git avatar Dec 09 '24 14:12 cau-git