freeze-dry
freeze-dry copied to clipboard
Snapshots a web page to get it as a static, self-contained HTML document.
Freeze-dry messes up if a stylesheet or framed document is encoded in utf16, utf32, or possibly other encodings. We [use](https://github.com/WebMemex/freeze-dry/blob/138a687ceba0aa8b079e3b30d982d74b1d703095/src/crawl-subresources.js#L137) [`FileReader.readAsText`](https://developer.mozilla.org/en-US/docs/Web/API/FileReader/readAsText) to decode these resources, which by default assumes utf8...
As explained in src/Readme: > Although we try to clone each Document living inside a frame (recursively), it may be impossible to access these inner documents because of the browser's...
Currently, we inline *all* resolutions listed in an ``'s `srcset`, all `` and `` sources, all stylesheets, etcetera. This makes snapshots huge. The upside is that the snapshot will be...
*This was previously [issue #134 in webmemex-extension](https://github.com/WebMemex/webmemex-extension/issues/134) ("Images not in snapshots from Medium.com").* When the page was viewed with javascript disabled, we should keep `` tags, and perhaps convert them...
Needs investigation, fixing, and tests. Some problems already apparent: - an SVG's `` element's `.href` property appears to not be a string, while we assume it to be. - an...
We currently split by comma, but that is a too simplistic approach. Especially as we use data URLs ourselves, which always contain a comma, we shoot ourselves in the foot...
I just noticed these [lines](https://github.com/ianb/pagearchive/blob/01f832583380309ec167c77f9af61e6f0af8f6aa/extension/make-static-html.js#L495-L497) in pagearchive: if (el.tagName == 'CANVAS') { return ''; }
Too many links are nowadays obscured by link shorteners and tracker URLs. For example, on Twitter, a link would point to https://t.co/1PT68A6LEt when the author meant to refer to https://voice.mozilla.org/....
I would like to enable calling `freezeDry(element)`, `freezeDry(range)`, and get back a string that serialises the given [Element](https://developer.mozilla.org/en-US/docs/Web/API/Element)/[Range](https://developer.mozilla.org/en-US/docs/Web/API/Range) (possibly also [DocumentFragment](https://developer.mozilla.org/en-US/docs/Web/API/DocumentFragment), array of elements, ...). This would be useful to...
I'm a little worried to see the WebMemex projects haven't been worked on in quite some time. I know they're both considered stable, but when it comes to browser-related projects,...