PDFLayoutTextStripper issues

can't generate text file

Error String index out of range: -1 in PDFLayoutTextStripper

2

Hi, Hi have this code, with attached PDF to test. public void doStrip() { String string = null; try { PDFParser pdfParser = new PDFParser(new RandomAccessFile(new File("D:/escaner/errorsPDFBOX/AN20-0149-0602201842.pdf"), "r")); pdfParser.parse(); PDDocument...

Jaumexr

Is it android compitible?

Can I use this library in my android app? Actually, I tried and got following error : java.lang.NoClassDefFoundError: Failed resolution of: Ljava/awt/color/ColorSpace;

dwitiyabhatt

Uncaught Index out of Bounds

We had a cae where the changed code was reached with index 0 (Character at beginnnig of line). With the proposed condition (index>0) program executes normal. Only one file was...

joachim-roth

Sample Code Doesn't Work

1

The Sample Code in the Readme file indicates that PDFParser takes in a RandomAccessFile and a string as a constructor. There is no constructor present with this signature however.

ghost

Is other language like Chinese supported?

Is other language like Chinese supported? What should I do in order to use this feature? With just Apache PDF Box, I can extract text from the PDF documents.

jjnnzb

Updated README.md

Fixes #40

ghost

C#

Any chance it would be implemented for c# PdfBox? -https://www.codeproject.com/Articles/538617/Working-with-PDF-files-in-Csharp-using-PdfBox-and

fayepsr

isSpaceCharacterAtIndex

return this.line.charAt(index) != SPACE_CHARACTER; it can happen that index is larger as char array length. try..catch or test against line.length I lazy fixed with try... catch and result is ok...

Schagalaah

getNumberOfNewLinesFromPreviousTextPosition

1

If height is 0 (can happen in some documents) the variable ""int numberOfLines" will be 2147483647 (Integer.MAX_VALUE). This will resolut in adding too much empty lines. quick dirty fix but...

Schagalaah

PDFLayoutTextStripper
PDFLayoutTextStripper copied to clipboard

Metadata

can't generate text file

Error String index out of range: -1 in PDFLayoutTextStripper

Is it android compitible?

Uncaught Index out of Bounds

Sample Code Doesn't Work

Is other language like Chinese supported?

Updated README.md

C#

isSpaceCharacterAtIndex

getNumberOfNewLinesFromPreviousTextPosition

← Metadata

Owner

Metadata

PDFLayoutTextStripper PDFLayoutTextStripper copied to clipboard

Metadata

← Metadata

Owner

Metadata

PDFLayoutTextStripper
PDFLayoutTextStripper copied to clipboard