Peter Williams

Results 13 comments of Peter Williams

We have some PDF full-text search code that is unfortunately in a private repo. The idea behind it is simple. We keep track of the `textMark`s that were used to...

Hi, PaperCut customers need PDF/A compliance. Is anyone else planning to work on this? If not then I can make a proposal.

Ben, I will investigate this. I have am working on a few versions of table extraction code that I have not submitted yet. They address most/all of the issues you...

Thanks. That will give me a benchmark to compare against.

Hi @cyberlord29 It looks like my table extraction code changes caused this problem. Those changes improve extraction of many other types of tables. We can give you a better experience...

peter.wi > @peterwilliams97 That sounds great , can you leave an email Id here so I can send it to you ? [email protected]

Hi Maneesh Sorry for the late reply. This is my email. ---------------------------------------------- Peter Williams 0488 783 700 / +61 488 783 700 On Thu, Nov 26, 2020 at 9:40 AM...

Coincidentally, I am away from PDF this month working on image processing. Which Go JPEG decoder(s) do you recommend?

This is a useful feature. I hope to find time to build a prototype on top of PageText. As noted above, this will require references from the TextMarks back to...

Part 2 requires adding links from the extracted text back to the content stream during text extraction. It's the same principle as the links from extracted text back to bounding...