PyMuPDF
PyMuPDF copied to clipboard
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
### Description of the bug ## What happened? If one installs the latest version of PyMuPDF, and they initialize a `Pixmap` object using Python `bytes`, they will see a log...
### Description of the bug [img_test.pdf](https://github.com/pymupdf/PyMuPDF/files/13817811/img_test.pdf) The image converted through the page.get_pixmap() function has characters that were not originally present in the PDF. The source file has characters that appear...
1.23.19: Building against mupdf's C++/Python language bindings is still quite rough around the edges
### Description of the bug Hi! I finally managed to upgrade mupdf to 1.23.9 and pymupdf to 1.23.19. I have tried this for weeks on and off and ran into...
## Describe the bug I'm trying to *extract text from PDF documents*, to isolate individual words and create an indexing system. For most PDF files, pymupdf (version 1.23.5) does a...
Hi, I wonder what causes this error: MuPDF error: syntax error: unknown keyword: '4.48823e' Thanks!
### Description of the bug https://github.com/pymupdf/PyMuPDF/issues/1286 Hello, the same bug is introduced, I downgraded to 1.23.5 and there it works as expected ### How to reproduce the bug https://github.com/pymupdf/PyMuPDF/issues/1286 ###...
### Description of the bug I want to create a pixmap from an annot and adapt the resolution with the "dpi" argument as stated in the docs. However, this is...
Addresses 3414
### Description of the bug Hi there! Thanks for the excellent software for manipulating PDF files. I have encountered the same issue as #3375. I am using `pymupdf` to remove...
### Description of the bug Hello! The function docstring for `get_text_blocks` in `utilts.py` suggests the return values as follows: ``` Returns: A list of the blocks. Each item contains the...