Zhengyu-Ju
Zhengyu-Ju
### Question ... **question**: How do I use the downloaded model? **Code**: source = "https://arxiv.org/pdf/2408.09869" # document per local path or URL converter = DocumentConverter() result = converter.convert(source) **downloaded models**...
### question when i use WordFormatOption() function to define a converter,i found that i cannot get the picture info ,i see the code of class SimplePipeline,l found that there is...
**question:** do we have a function to generate a floder which contains images floder and markdown file **Code:** ```python doc_converter = DocumentConverter( format_options={ InputFormat.PDF: PdfFormatOption( pipeline_options=pipeline_options, backend=PyPdfiumDocumentBackend ) } )...
如何将pdf的图片不要进行ocr识别 ``` p2t = Pix2Text.from_config(total_configs=total_config, enable_formula=False, enable_table=True, device='cpu') start = time.time() out_md = p2t.recognize_pdf( img_fp, table_as_image=False, save_debug_res=f'./output-debug', ) ``` 如果可配置,在哪里配置