warcreate
warcreate copied to clipboard
Chrome extension to "Create WARC files from any webpage"
As suggested by Noah Levitt @ internet archive.
Not really an issue, but a limitation of https://github.com/eligrey/FileSaver.js . This could be problematic when appending to existing WARCs or creating WARCs from multiple web pages at once.
Likely critical but might not be available via Chrome's webRequest API. **Heritrix 3.2.0** ``` sh WARC/1.0 WARC-Type: request WARC-Target-URI: http://matkelly.com/ WARC-Date: 2015-12-11T13:25:07Z WARC-Concurrent-To: WARC-Record-ID: Content-Type: application/http; msgtype=request Content-Length: 207 GET...
Rather than capturing the page at temporal intervals, this will instruct WARCreate to capture the page every time it dynamically reloads or refreshes. This will ensure that there are no...
The user will be able to specify an interval (e.g., every 10 minutes) after which WARCreate will reload and then archive the page. This will enable users to leave their...
Users are lost and unsure that the process, which might take a few seconds, is underway.
Like http://github.com/machawk1/mink to give easier access to the information the user wants to know about the project as well as GH wiki links to use cases, etc.