PyMuPDF icon indicating copy to clipboard operation
PyMuPDF copied to clipboard

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Results 162 PyMuPDF issues
Sort by recently updated
recently updated
newest added

### Description of the bug We are trying to detect any type of widget and delete it. When we perform the operation `page.delete_widget(w)` it throws AttributeError: `'Annot' object has no...

bug
fix developed

### Description of the bug I have a script that takes a PDF document URL, iterates through all the pages, generates a pixmap for each page, and uses it to...

bug
example required

### Description of the bug Hello, First of all, thank you for all the work you've been putting into this project. Last November, I reported a minor memory leak issue...

upstream bug
fix developed

### Description of the bug Text removal from pdf with PyMuPDF works good in python3.8 and PyMuPDF 1.20.2: Python bindings for the MuPDF 1.20.3 library but if used python3.12 and...

upstream bug
fix developed

### Description of the bug `get_text()` extracts numbers in the Cash Flow table in this document as hexadecimal characters. Copy/paste from the page and `pdftotext` extract the correct text. ###...

upstream bug
fix developed

Relates to issue #3163. This updates updates the area of the rectangle based on the floating point. Instead of simply typecasting from float to int, this function now floors the...

### Description of the bug Hello, I'm using the following function to convert PDF files into PNG: ``` pix = page.get_pixmap(matrix=fitz.Matrix(40, 40),alpha=False,colorspace=fitz.csGRAY) pix.save(image_path, "png") ``` The conversion occurs as expected...

upstream bug
fix developed

Info 1: ![image](https://github.com/pymupdf/PyMuPDF/assets/58320500/97d87637-4d31-4960-b95e-595ef5bc70d2) Info 2: ![image](https://github.com/pymupdf/PyMuPDF/assets/58320500/c403bd22-9af5-4fac-8572-e55a7946dd53) Info 3: ![image](https://github.com/pymupdf/PyMuPDF/assets/58320500/c42f14fb-4b28-4fd9-9f60-f259f00c18a6) Using Vscode with Type Checking even set to "basic", still receive an alert about not finding the `Page` base for use...

duplicate
enhancement
postpone

### Feature request Can OCR using Tesseract add a user-settable parameters for page segmentation mode (psm)? This would be very useful because when source documents are forms, OCR recognizes the...

enhancement

**Is your feature request related to a problem? Please describe.** Python type hints (which can be validated using mypy or Pylance in vscode) are very helpful; fitz_new has much better...

enhancement
postpone