pdf-reader
pdf-reader copied to clipboard
crop text in 'Tj' PagesStrategy::OPERATORS
What I see in pdf
Text what I see when call page.text

However, in page.raw_content I can see all date text

Can I be sure it just date format croping? Or it some system problem and when in that place would '22.12.2019' I`ll get '22.12.20' instead '22.12.19' ?
This is likely to be the fault of the primitive algorithm in PageLayout. I'd love to find time to improve it!
The algorithm sometimes results in characters that will overlap, in which case some characters will be left out.