PyMuPDF issues

AttributeError: `'Annot' object has no attribute 'del'`

### Description of the bug We are trying to detect any type of widget and delete it. When we perform the operation `page.delete_widget(w)` it throws AttributeError: `'Annot' object has no...

karenli6

bug

fix developed

get_pixmap method stuck on one page and runs forever

7

### Description of the bug I have a script that takes a PDF document URL, iterates through all the pages, generates a pixmap for each page, and uses it to...

SofiiaChaban

bug

example required

Memory leaks when merging PDFs

2

### Description of the bug Hello, First of all, thank you for all the work you've been putting into this project. Last November, I reported a minor memory leak issue...

cormier

upstream bug

fix developed

Remove text not working in 1.23.25 version vs 1.20.2

6

### Description of the bug Text removal from pdf with PyMuPDF works good in python3.8 and PyMuPDF 1.20.2: Python bindings for the MuPDF 1.20.3 library but if used python3.12 and...

bilykigor

upstream bug

fix developed

page.get_text() returns hexadecimal text for some characters

2

### Description of the bug `get_text()` extracts numbers in the Cash Flow table in this document as hexadecimal characters. Copy/paste from the page and `pdftotext` extract the correct text. ###...

brandenkmurray

upstream bug

fix developed

Modified util_make_irect() to capture area more robustly;

4

Relates to issue #3163. This updates updates the area of the rectangle based on the floating point. Instead of simply typecasting from float to int, this function now floors the...

psambit9791

PDF's 45º lines dissapearing in png conversion

3

### Description of the bug Hello, I'm using the following function to convert PDF files into PNG: ``` pix = page.get_pixmap(matrix=fitz.Matrix(40, 40),alpha=False,colorspace=fitz.csGRAY) pix.save(image_path, "png") ``` The conversion occurs as expected...

gabrielbeneli-missler

upstream bug

fix developed

Vscode Type Checking cannot access member "get_pixmap" for type "Page". Member "get_pixmap" is unknown - Pylance

1

Info 1: ![image](https://github.com/pymupdf/PyMuPDF/assets/58320500/97d87637-4d31-4960-b95e-595ef5bc70d2) Info 2: ![image](https://github.com/pymupdf/PyMuPDF/assets/58320500/c403bd22-9af5-4fac-8572-e55a7946dd53) Info 3: ![image](https://github.com/pymupdf/PyMuPDF/assets/58320500/c42f14fb-4b28-4fd9-9f60-f259f00c18a6) Using Vscode with Type Checking even set to "basic", still receive an alert about not finding the `Page` base for use...

fredericomattos

duplicate

enhancement

postpone

Support for Page Segmentation Mode for calling Tesseract OCR

5

### Feature request Can OCR using Tesseract add a user-settable parameters for page segmentation mode (psm)? This would be very useful because when source documents are forms, OCR recognizes the...

stevesimmons

enhancement

Improve the Python type annotations for fitz_new

5

**Is your feature request related to a problem? Please describe.** Python type hints (which can be validated using mypy or Pylance in vscode) are very helpful; fitz_new has much better...

indigoviolet

enhancement

postpone

PyMuPDF
PyMuPDF copied to clipboard

Metadata

AttributeError: `'Annot' object has no attribute 'del'`

get_pixmap method stuck on one page and runs forever

Memory leaks when merging PDFs

Remove text not working in 1.23.25 version vs 1.20.2

page.get_text() returns hexadecimal text for some characters

Modified util_make_irect() to capture area more robustly;

PDF's 45º lines dissapearing in png conversion

Vscode Type Checking cannot access member "get_pixmap" for type "Page". Member "get_pixmap" is unknown - Pylance

Support for Page Segmentation Mode for calling Tesseract OCR

Improve the Python type annotations for fitz_new

← Metadata

Owner

Metadata

PyMuPDF PyMuPDF copied to clipboard

Metadata

← Metadata

Owner

Metadata

PyMuPDF
PyMuPDF copied to clipboard