PyMuPDF
PyMuPDF copied to clipboard
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
**Is your feature request related to a problem? Please describe.** I'd like to implement a functionality which processes multiple PDF files one by one. Some PDFs are "valid" while some...
### Description of the bug I am attempting to install PyMuPDF on a Docker container running on an Apple Silicon. The installation is successful, but it is impossible to import...
### Description of the bug The Dockerfile failed to build because it couldn’t find the make command. This is indicated by the error message /bin/sh: 1: make: not found. The...
### Description of the bug When the PDF created after deleting text by applying the add_redact_annot() function is opened in a web browser, the shapes are not displayed correctly. ###...
### Description of the bug When processing larger PDF files the page.get_pixmap() method significantly increases memory usage and does not release it properly after completion. It results in a high...
**Is your feature request related to a problem? Please describe.** No, just a suggestion to improve the documentation. **Describe the solution you'd like** The following pages describe the basics of...
### Description of the bug # Problem Installing PyMuPDF on linux/arm64 fails while install dependencies on linux/arm64 docker build. To make multiarch image, I've made a github workflow that builds...
### Description of the bug I want to remove all texts and only keep vector graphics (such as straight lines) in PDF, the code and result are shown below. However,...
**Is your feature request related to a problem? Please describe.** Currently, this library takes _forever_ to compile on Alpine Linux for Docker containers running on ARM. Please cross-compile the library...
### Description of the bug Pdf file transform to image have a black block [89b59dbfae5e4d1d92596418e9585a10.pdf](https://github.com/user-attachments/files/15984196/89b59dbfae5e4d1d92596418e9585a10.pdf) ### How to reproduce the bug def pdf2png_with_pymupdf(pdf_data: Union[bytes, str], matrix: int = 2): """转换图片."""...