High memory usage when reading a specific PDF
Describe the bug
When trying to convert PDF to text, I notice very high memory usage for one specific PDF. This might mean there's a memory leak in the library.
To Reproduce
Code to reproduce the issue:
val reader = PdfReader(contents)
val extractor = PdfTextExtractor(reader)
for (i in 1..reader.numberOfPages) {
val text = extractor.getTextFromPage(i)
onPageParsed(text)
}
When reaching page 4 of the attached PDF, I see this uses ~17GB of memory. With other PDFs, this is way lower.
Expected behavior
Not using this much memory
System
- OS: macOS, Google Cloud Run
- OpenPDF version: 2.0.3
Your real name
Arne Stockmans
Additional context
This might mean there's a memory leak in the library.
It might mean that the memory could be used more efficiently but not a memory leak. Or did you see increasing memory which wasn't freed afterwards?
This might mean there's a memory leak in the library.
It might mean that the memory could be use more efficiently but not a memory leak. Or did you see increasing memory which wasn't freed afterwards?
Indeed, you're right, I didn't word it correctly there. The memory is freed afterwards