docling
docling copied to clipboard
Make EasyOCR optional dependency
Requested Feature
It would be beneficial to make OCR models optional during installation, with EasyOCR remaining as the default option. In our case, we use TesseractOCR but are required to install EasyOCR since it's currently mandatory, even though we don't use it.
Here's a proposed installation approach:
-
All OCR models:
pip install docling[all] -
EasyOCR only (default installation):
pip install docling[easyocr] -
Specific OCR models:
pip install docling[tesseract] -
Base installation (no OCR models):
pip install docling
Alternatives
- Install Docling as is - This installs EasyOCR and its dependencies even when they're not needed.
- Install Docling without dependencies - This requires significant maintenance effort on our end to ensure version compatibility.