auto-archiver
auto-archiver copied to clipboard
use https://github.com/webrecorder/warcio.js to avoid extracting media for html file
Since the wacz_enricher can sometimes have hundreds of media items it does not scale well, one alternative is to dynamically fetch those using JS in the generated hmtl file, this is an experimental issue to test if that is feasible with: https://github.com/webrecorder/warcio.js
Expected steps:
- extract the warc from wacz file
- use https://github.com/webrecorder/warcio.js to get media entries and display them in the browser
This would be one of the most impactful changes as it would allow to store less individual files (a lot of which are logos/profilepics etc).
https://github.com/webrecorder/warcio.js/issues/67 blocked by this issue