nosia icon indicating copy to clipboard operation
nosia copied to clipboard

Improve PDF parsing

Open cbldev opened this issue 8 months ago • 0 comments

Actual behavior

I noticed that on some complex PDF, with tables, pdftotext produce better result than pdf-reader gem.

pdftotext: https://www.xpdfreader.com/pdftotext-man.html

Issue in Langchainrb: https://github.com/patterns-ai-core/langchainrb/issues/682

Expected behavior

Good results on complex PDF parsing.

cbldev avatar Jun 29 '24 12:06 cbldev