Sebastian

Results 166 comments of Sebastian

@Niedzwiedzw which approach are you using? I will try to give you instructions on how to get the relevant information without leaking the sensitive data tomorrow.

I don't think we are going to switch to nom. It is great, but PDF is a mess and we already have a handwritten parser.

The PDF Reference lists `ColorSpace`as a non-optional field of inline images. And I have no intention of allowing various derivations from the specification as that is a hole without bottom.

@Niedzwiedzw you are in luck. The `color_space`field is an `Option`, so I went ahead and made it optional in inline images.

I may have used a new feature: https://blog.rust-lang.org/2022/01/13/Rust-1.58.0.html#captured-identifiers-in-format-strings Please check if it works with Rust 1.58

The examples are mostly outdated.. unfortunately. Please check https://github.com/pdf-rs/pdf_render/blob/master/render/examples/trace.rs for text extraction.

You should be able to run the example from inside the `render` folder. The egl stuff is only needed for pdf2image.

I would HIGHLY recommend that you peek at https://github.com/pdf-rs/pdf_render/blob/master/render/src/cache.rs#L188 and the returned TraceResults. We use it in production, so it is mostly proven (although not in German). It should work...

`tracer.finish()` will give you a `Vec`. `TraceResults` was unused and I removed it now.