Martin Hasoň
Martin Hasoň
- https://regexper.com/ - https://jex.im/regulex/
DjVuXML describs the metadata, hyperlinks, and hidden text associated with the DjVu/PDF file. DjVuXML file is used for the text layer in https://github.com/internetarchive/bookreader
hOCR is an open standard of data representation for formatted text obtained from optical character recognition (OCR).
I want to use ReLaXed with remote chromium to reduce docker image size. It is posible with browser options `--remote-debugging-address` and `--remote-debugging-port` (https://peter.sh/experiments/chromium-command-line-switches/). It would be great add CLI options...
For roles it would be great to support pandoc syntax ([try](https://pandoc.org/try/?text=%5Binterpreted+text%5D%7Brole%3D%7D%0A%0A%5Binterpreted+text%5D%7Brole%3Dcustom%7D&from=markdown&to=rst&standalone=0)): ```md [interpreted text]{role=} [interpreted text]{role=custom} ```
I propose to use pandoc syntax for divs for directives ([try](https://pandoc.org/try/?text=..+block%3A%3A+Title%0A+++%3Aparam%3A+1%0A%0A+++Text+in+block%0A%0A+++..+nested%3A%3A%0A++++++%3Aparam%3A+2%0A%0A++++++Text&from=rst&to=markdown&standalone=0)): ```markdown ::: {.block param="1" argument="Title"} Text in block ::: {.nested param="2"} Text ::: ::: ```