pdfminer icon indicating copy to clipboard operation
pdfminer copied to clipboard

Implemented OneByteIdentityH/V encoding cmap (with test pdf)

Open hynekcer opened this issue 7 years ago • 0 comments

Some type of CID fonts with "Identity" cmap is problematic and nothing is extracted. An example is http://stackoverflow.com/questions/43832499/pdfminer-empty-output

I fixed it. The simple test file is taken from question http://stackoverflow.com/questions/38119176/itextsharp-font-widths-definition-not-correctly-loaded

Maybe a better or more general solution would be to use the existing /ToUnicode map, but my solution is similar to /Identity-H that also ignores the unicode map and believe that it is an identity.

hynekcer avatar May 13 '17 22:05 hynekcer