PyMuPDF
PyMuPDF copied to clipboard
Remove text not working in 1.23.25 version vs 1.20.2
Description of the bug
Text removal from pdf with PyMuPDF
works good in python3.8 and PyMuPDF 1.20.2: Python bindings for the MuPDF 1.20.3 library
but if used python3.12 and PyMuPDF 1.23.25 - not all text is removed
How to reproduce the bug
`import fitz pdf_doc = fitz.open('data/file.pdf') page = pdf_doc.load_page(0) for block in page.get_text("words"): rect = fitz.Rect(block[:4]) page.add_redact_annot(rect) page.apply_redactions()
pdf_doc.save('data/file_1.pdf') pdf_doc.close()`
PyMuPDF version
1.23.25
Operating system
MacOS
Python version
3.12
Could you add the input file, data/file.pdf, to this issue page?
Unfortunately not, it is under NDA. Maybe I can share some metadata?
The bug starts from PyMuPDF-1.23.9 while in 1.23.8 all good.
Sorry, but we need data to reproduce the problem. You can use direct e-mail addresses if there are data protection concerns, or else a file cleaned from confidential data.
is it valid mail? [email protected]
@bilykigor yes - use it with confidence
Fixed in 1.24.0.