pdfalto icon indicating copy to clipboard operation
pdfalto copied to clipboard

PDF to XML ALTO file converter

Results 83 pdfalto issues
Sort by recently updated
recently updated
newest added

First, thanks for a great tool! I have problems with soft hyphens being omitted: Resulting in: ``` ``` Is this intentional? Or would it be possible to replace soft hyphens...

Originally from this issue https://github.com/kermitt2/grobid/issues/241 ``` (base) [Luca@falcon lin-64]$ ./pdfalto_server -fullFontName -noLineNumbers -noImage -annotation -filesLimit 2000 /tmp/TUW-217619.pdf /tmp/TUW-217619.alto.xml --timeout 50 Segmentation fault ``` ``` (base) Lucas-MacBook-Pro:mac-64 lfoppiano$ ./pdfalto_server -fullFontName -noLineNumbers...

Document DOI: dx.doi.org/10.1063/1.3068408 (probably there is a paywall) ![image](https://user-images.githubusercontent.com/15426/180397932-8fb5c2c1-73f9-4a52-83b3-1feffc3a21a0.png) pdfalto from grobid-0.7.2-SNAPSHOT extracts it as (interesting is with ID=`p1_w193`): ```xml ``` The characters are mapped incorrectly? I forgot what was...

Hi, again with compiler error. Now on newer REDHAT. `[hamersjbo@erstapfrk004 pdfalto]$ make Consolidate compiler generated dependencies of target xpdf [ 35%] Built target xpdf Consolidate compiler generated dependencies of target...

sample here: [heap-bufferoverflow-pos-)%at pdfalto.zip](https://github.com/kermitt2/pdfalto/files/8811939/heap-bufferoverflow-pos-.at.pdfalto.zip) Describe info: ``` $ ./pdfalto heap-bufferoverflow-pos-\>at\ pdfalto.cc\:190\:5 ==43072==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x6040000000f1 at pc 0x00000043f92b bp 0x7fff65e8f0f0 sp 0x7fff65e8e8a0 WRITE of size 33 at 0x6040000000f1...

Thanks for your great tool. I have a question about the background image in this PDF (the lines of the table). The extracted svg is of this pdf is quite...

I need it for a down-stream XSLT pipeline; https://gitlab.coko.foundation/XSweet/XSweet/-/tree/pdf2html/applications/pdf2html

question

Is it also possible to convert an Alto XML file to PDF?

After performing the build steps [described in the README](https://github.com/kermitt2/pdfalto#build), I encountered the following error with the `make` command: ``` joe@choskimac-iii workspace % git clone https://github.com/kermitt2/pdfalto.git && cd pdfalto Cloning into...

bug
macos-specific