auto-archiver icon indicating copy to clipboard operation
auto-archiver copied to clipboard

use https://github.com/webrecorder/warcio.js to avoid extracting media for html file

Open msramalho opened this issue 1 year ago • 2 comments

Since the wacz_enricher can sometimes have hundreds of media items it does not scale well, one alternative is to dynamically fetch those using JS in the generated hmtl file, this is an experimental issue to test if that is feasible with: https://github.com/webrecorder/warcio.js

Expected steps:

  • extract the warc from wacz file
  • use https://github.com/webrecorder/warcio.js to get media entries and display them in the browser

msramalho avatar Sep 20 '23 08:09 msramalho

This would be one of the most impactful changes as it would allow to store less individual files (a lot of which are logos/profilepics etc).

msramalho avatar Nov 13 '23 15:11 msramalho

https://github.com/webrecorder/warcio.js/issues/67 blocked by this issue

msramalho avatar Feb 14 '24 17:02 msramalho