docling
docling copied to clipboard
fix: Introduce Image format options in CLI. Silence the tqdm downloading messages.
trafficstars
This is a fix to:
- Introduce format options in the docling CLI for Image with the same
pipeline_optionsas for PDF. This allows the CLI parameters to be applied in case of image inputs. - Add
RapidOcrOptionsto the Union ofocr_optionsforPdfPipelineOptions. - Silence the tqdm messages during the downloading of model files.
Issues resolved by this Pull Request: Resolves #505 #208
Checklist:
- [x] Documentation has been updated, if necessary.
- [x] Examples have been added, if necessary.
- [x] Tests have been added, if necessary.
Merge Protections
Your pull request matches the following merge protections and will not be merged until they are valid.
🟢 Enforce conventional commit
Wonderful, this rule succeeded.
Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
- [X]
title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:
Maybe one more check to do is, if the input format is an image, OCR must be activated for its pipeline options (independent of the global OCR choice)