notpaper
notpaper copied to clipboard
papertohtml
https://papertohtml.org/
Perfect tool for this project.
This is great! I'd love to know what their processing pipeline is, like what structured data they extract from PDF/LaTeX, and how they use it to generate HTML.
Oh I just found a paper of theirs