PyMuPDF
PyMuPDF copied to clipboard
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
I got pymupdf pro key and tried to pymupdf4llm.mark_down(example.hwpx) Then I got runtime error. RuntimeError: code=7: cannot find entry Contents/Contents/header.xml But this did not happen when I executed mark_down function...
### Description of the bug doc.set_toc does not always properly set y-positions. In my example pdf, they will be off by 90 pts. ### How to reproduce the bug [buggy_toc_positions.pdf](https://github.com/user-attachments/files/20021312/buggy_toc_positions.pdf)...
### Description of the bug At a `.docx` file, PyMuPDF Pro did not display certain graphical purple elements: With PyMuPDF Pro:  With Google Docs or Mac's Pages app: ...
### Description of the bug Not sure if this is documented behavior, but couldn't find it. To recreate: 1. Take a PNG file and rename it with a PDF extension....
### Description of the bug For some documents, PyMuPDF Pro splits the document into many more pages than if I open the document with Google Docs (or Mac Pages/libreoffice). This...
PyMuPDF Pro 1.25.0: Image in `.doc` file unexpectedly overlaps with text when using `get_pixmap()`
### Description of the bug Sometimes, an embedded image inside a `.doc` file overlaps with the text when creating an image of the document using `get_pixmap()`, although at other software...
### Description of the bug I am using the 940b: https://www.irs.gov/pub/irs-pdf/f940b.pdf The PDF file has identical pages, and each page has this specific dropdown:  The choice_values variable is empty....
### Description of the bug Based on my research, Mediabox defines size of the pdf page. Cropbox defines the rect of the page displayed by PDF Viewers. Pixmap displays the...
### Description of the bug Cached data from a one PDF file can cause incorrect colors and shapes when generating a PixMap of a page of a different PDF file....
Is there any way this tool can be installed on docker using docker compose ? It's a great tool would be very happy if there is a way to use...