CERMINE
CERMINE copied to clipboard
Apparent memory leak when running multiple workers
I'm experiencing a memory leak when running cermine
with multiple workers, on a large set of PDFs.
java -cp cermine-impl-1.13-Ian2P-jar-with-dependencies.jar pl.edu.icm.cermine.ContentExtractor -path /data/path -outputs jats,text,zones -workers 4
OK now I'm not sure if it's actually a leak... closing for more experimentation...
Upon further investigation, there is definitely a memory leak.
Initial memory use is around 5G, and just creeping up overnight.
Hi @rueberger , could you specify the exact version you are using? 1.13 doesn't seem to have option "-workers".