pdfparser icon indicating copy to clipboard operation
pdfparser copied to clipboard

Issue With Parsing Graphical PDF

Open omitpavel opened this issue 1 year ago • 1 comments

Facing Issue While Trying Get Text Of Graphical Content ,

The PDF : https://www.acwilgar.co.uk/Boiler-Manual-PDF/Ideal/Logic-Combi-Installation-Manual.pdf

page 12 to 20

always getting Maximum execution time of 120 seconds exceeded

omitpavel avatar Nov 13 '24 08:11 omitpavel

This doesn't look like a bug in pdfparser to me. I can extract the text from the linked file with this code successfully:

<?php
include 'vendor/autoload.php';
$parser = new \Smalot\PdfParser\Parser();
$pdf = $parser->parseFile('./Logic-Combi-Installation-Manual.pdf');
foreach ($pdf->getPages() as $page) {
  echo $page->getText();
}

@omitpavel Try increasing your value for max_execution_time in your PHP environment until you have enough time allowed to parse the file. You can do this temporarily with the set_time_limit() function.

rupertj avatar Nov 03 '25 09:11 rupertj