pdf2htmlEX icon indicating copy to clipboard operation
pdf2htmlEX copied to clipboard

Convert PDF to HTML without losing text or format.

Results 133 pdf2htmlEX issues
Sort by recently updated
recently updated
newest added

### Here is how installed and ran pdf2htmlEX: `wget https://github.com/pdf2htmlEX/pdf2htmlEX/releases/download/v0.18.8.rc1/pdf2htmlEX-0.18.8.rc1-master-20200630-Ubuntu-bionic-x86_64.deb` `sudo apt install -y ./pdf2htmlEX-0.18.8.rc1-master-20200630-Ubuntu-bionic-x86_64.deb` `pdf2htmlEX --zoom 1.5 9702_s24_qp_13.pdf` ### Output: Preprocessing: 20/20 Working: 0/20 Oops! Something went horribly wrong.......

Fixed the issue #81 Requires these libreries to be installed: `libopenjp2-7-dev libopenjp2-7`

I just tried by downloading latest font from online and created pdf to test. pdf2htmlEx idenitfied that with 100% accuracy. How it is possible to identify? Is it calling any...

every time I used pdf2htmlEX command line tool to convert pdf to html, I received single image, including background of the page, all the images belonging to the page. So,...

Version: ``` /usr/local/bin/pdf2htmlEX --version pdf2htmlEX version 0.18.8.rc2 Copyright 2012-2015 Lu Wang and other contributors Libraries: poppler 24.01.0 libfontforge (date) 20230101 cairo 1.16.0 Default data-dir: /usr/local/share/pdf2htmlEX Poppler data-dir: /usr/local/share/pdf2htmlEX/poppler Supported image...

pdf2htmlEX\pdf2htmlEX\src\HTMLRenderer\outline.cc ![image](https://github.com/user-attachments/assets/5af25222-4373-4218-a1c2-ea2592f66064) ```diff - writeUnicodes(f_outline.fs, item->getTitle(), item->getTitleLength()); + writeUnicodes(f_outline.fs, item->getTitle().data(), item->getTitle().size()); ``` | Feature | Old `getTitle()` | New `getTitle()` | |----------------|-----------------|-----------------| | Return type | `Unicode*` | `std::vector` |...

Newer browsers support so called text fragments in URLs to select a text on a newly opened page. See: - https://wicg.github.io/scroll-to-text-fragment/ - https://developer.mozilla.org/en-US/docs/Web/URI/Fragment/Text_fragments This works also with html pages generated...

After using this section for building I was not able to find pdf2htmlEX in /usr/local/bin as shown below

Your [`README.md`](https://github.com/pdf2htmlEX/pdf2htmlEX/blob/master/README.md) contains five demos including "Git Manual CJK/HTML". However, when I try to access to the HTML content of "Git Manual...", it shows the 404 Not Found error like...