sparql.anything icon indicating copy to clipboard operation
sparql.anything copied to clipboard

Lightweight HTML Triplifier

Open jmkeil opened this issue 8 months ago • 9 comments

The package io.github.sparql-anything.sparql-anything-html has a heavy storage footprint (>160MB) due to its dependency on com.microsoft.playwright.driver-bundle, which basically five times ships Node.js binaries (Windows, Linux, Linux ARM, Mac and Mac ARM). To my understanding, this is needed to run a headless browser that interprets JS in the triplified HTML.

I guess this is not needed in many use cases.

Therefore, I would like to ask you to consider providing an additional lightweight HTML Triplifier that just triplifies the static HTML document. This would result in significantly smaller binaries of upstream projects and would probably also improve the execution time.

jmkeil avatar Jun 13 '24 09:06 jmkeil