openhtmltopdf icon indicating copy to clipboard operation
openhtmltopdf copied to clipboard

Large Html conversion consume much memory

Open Hikariqz opened this issue 1 year ago • 2 comments

Hi guys,

I'm having an issue where converting a large HTML document to PDF is consuming a significant amount of memory. The HTML represents around 1200 pages, but the file size is only around 3MB.

When generating the PDF, I noticed my Java VM memory usage increasing up to around 1GB. I used a profiler to investigate further and found that a lot of byte arrays were being created during the PDF generation process. Most of the memory usage appeared to be occurring in calls to com.openhtmltopdf.css.newmatch.Condition#matches which involves java.lang.StringBuilder.toString or java.lang.StringBuilder.

image

I'm hoping someone may be able to provide some help or insights on how I could optimize this process to use less memory. Converting such a large number of HTML pages to PDF seems to be straining the memory usage and I want to find a more efficient way to handle it.

Really appreciate it!

Hikariqz avatar Jan 23 '24 16:01 Hikariqz

I'm going to take a look at this. I'm also working with some pretty large HTML files, so this affects me too. In the meantime, I'm going to tag you on a duplicate issue at a forked repository where we're going to be doing new development. (see #921)

See: https://github.com/openhtmltopdf/openhtmltopdf/issues/1#issue-2096902877

siegelzc avatar Jan 23 '24 20:01 siegelzc

Thank you @siegelzc

Hikariqz avatar Jan 24 '24 01:01 Hikariqz