Patrice Lopez

Results 601 comments of Patrice Lopez

Just to confirm that all your examples work fine once the xpdf language support packages installed, branch https://github.com/kermitt2/pdfalto/tree/language_packages (not yet merge).

Hi @samanthadalal, the public Grobid service was overloaded. I restarted it to clean the queue, you can try now, but it might be saturated again because some people launched heavy...

@bmorton1, so far the ligature are left as such, so if we have a `\uFB00`, we leave it as such and we don't rewrite it as 2 characters `ff`. We...

@vsolovyov the mapping is in both GROBID master and released versions. However, if the character is "dropped", it means that the unicode is not resolved for this glyph. It relates...

Hi @naufraghi ! First I am very sorry to be so slow to react to your PR... it seems that I have some difficulties to work more than 2-3 days...

Update to xpdf-4.03 was actually a bit complicated: new usage of fontconfig (if detected), updated form classes, and the `codeToGID` parameter required non-trivial specific init (https://github.com/kermitt2/pdfalto/commit/96bec4396819b3716bd4c4e2789a5109ccdcde9f#diff-4a663af7fcefc0e10a654736ff3a8d6184763165c567b28be0321b38167c639b) Normally everything is good...

Hello @nicolasfranck , thanks for reporting the issue! It's not working, but it's not the setting of the option in `src/pdfalto.cc`, there's a piece missing in TextPage::endPage (originally in pdf2xml,...

Hi @joewiz ! If I remember well, on mac `fontconfig/fontconfig.h` is installed under `/usr/local/include` but for some reasons, the compiler you use is not looking at `/usr/local/include` - this is...

> The solution then is to add `-I/usr/local/include` as compiler option. so I think adding to `CMakeLists.txt` the line: ``` set(CMAKE_REQUIRED_INCLUDES "/usr/local/include" "/usr/include") ``` I also saw this possible solution:...

Thank you for the feedback ! I take the freedom to reopen the issue, I will try to add a MacOS-specific fix directly in the CMakeList file because this is...