Zhengyu-Ju

Results 6 comments of Zhengyu-Ju

bug:(MaxRetryError("HTTPSConnectionPool(host='huggingface.co', port=443): Max retries exceeded with url: /api/models/ds4sd/docling-models/revision/v2.0.1 (Caused by NewConnectionError(': Failed to establish a new connection: [Errno 101] Network is unreachable'))"), '(Request ID: f7ff0951-b939-4902-bb59-0ef04176f1c2)') code source = "/home/ubuntu/xxxxxx" converter...

**question:** do we have a function to generate a floder which contains images floder and markdown file **Code:** doc_converter = DocumentConverter( format_options={ InputFormat.PDF: PdfFormatOption( pipeline_options=pipeline_options, backend=PyPdfiumDocumentBackend ) } ) result...

### example file [test.docx](https://github.com/user-attachments/files/17838922/test.docx) ### code source = '/home/ubuntu/test.docx' pipeline_options = PdfPipelineOptions() pipeline_options.do_ocr = False pipeline_options.do_table_structure = True pipeline_options.table_structure_options.do_cell_matching = True pipeline_options.generate_picture_images = True artifacts_path = "/tmp/pycharm_project_763/fileReader/docling-models" pipeline_options.artifacts_path = artifacts_path...

l have tried use the pipeline_options to define WordFormatOption bedore,like this DocumentConverter( format_options={ InputFormat.PDF: PdfFormatOption( pipeline_options=pipeline_options ), InputFormat.DOCX: WordFormatOption( pipeline_options=pipeline_options ), } ) but i also get no pic info

``` from docling.datamodel.pipeline_options import PdfPipelineOptions IMAGE_RESOLUTION_SCALE = 2.0 import logging import time from pathlib import Path from docling_core.types.doc import ImageRefMode, PictureItem, TableItem from docling.datamodel.base_models import FigureElement, InputFormat, Table from docling.datamodel.pipeline_options...

![image](https://github.com/user-attachments/assets/229b5181-8816-46d2-bad1-25d1d2509ece)