PDFIO.jl icon indicating copy to clipboard operation
PDFIO.jl copied to clipboard

PDF Reader Library for Native Julia.

Results 24 PDFIO.jl issues
Sort by recently updated
recently updated
newest added

`pdPageExtractText` API is one the core APIs of PDFIO. However, smaller large number of allocations make it a bit slower. This code needs to be refactored to ensure the text...

performance

@JuliaRegistrator register()

tracker

Natural tabular objects in a PDF document should ideally be picked up for extraction. The intent of the project is API development, hence it will be headless for most part....

enhancement

I tried to run pdPageExtractText on the pdf located: https://www.gov.im/media/1360682/isle-of-man-inflation-report-november-2021.pdf However, every character of the text was being interpreted as "\0" After much pain and effort trailing through the PDFIO...

What does this error mean? ```julia julia> pdPageExtractText(stdout, page) ERROR: AssertionError: Invalid interval Stacktrace: [1] Interval @ C:\Users\bdeon\.julia\packages\Rectangle\Imrhs\src\interval.jl:5 [inlined] [2] Interval @ C:\Users\bdeon\.julia\packages\Rectangle\Imrhs\src\interval.jl:20 [inlined] [3] on_cmap_command!(stm::IOBuffer, command::Symbol, params::Vector{CosInt}, cmap::PDFIO.PD.CMap) @...

Hi! I found the `Unexpected character` error while parsing many of my PDFs. Here is one example of a PDF giving me that error: https://drive.google.com/file/d/1YXdN7TfwK87_5ekbUElYRFOkVLifKj1F/view?usp=sharing ```julia julia> pdDocOpen("/home/diego/Downloads/Vernon et al....

It is common to use different fonts to denote semantic meaning (e.g italics for emphasis or larger font size for section titles). Is it possible to extract text that is...

enhancement

This implementation may be needed to be reviewed along with #2. Although, there may not be an exact overlap in some cases the implementation logic can be similar.

enhancement

Content filter for JPEG and JPEG2000 should be supported. Since, these are special type filters whether decoding over direct streaming into the graphics channel for rendering should be reviewed.

enhancement

SASLPrep can be implemented using the Unicode consortium supplied libraries: http://site.icu-project.org/ but I guess this may be unnecessarily added dependency. Enhancement request has been raised to include the feature in...