HummusJSSamples
HummusJSSamples copied to clipboard
text-extraction thwarted by inline images
Played a bit with text-extraction sample and found that if an inline image is encountered (a BI / ID / EI construct) the rest of the page is skipped. Most likely this is happening because the image stream that follows ID is parsed as a PDF token not as a stream.
Any hint on how I might skip inline images?
Thanks!