Alex Osborne
Alex Osborne
@ikreymer noticed that the original site (https://www.cbns.org.au/) replays in archiveweb.page. It looks like the reason for this is that the implementation in [jsrewriter.js](https://github.com/webrecorder/wabac.js/blob/main/src/rewrite/jsrewriter.js#L34) is slightly different. In wabac/archiveweb.page when eval...
Since browsers change POST to GET when following a 302 redirect some sort of workaround is needed. The obvious options for implementing POST replay seem to be: - use 307...
> As per RFC 5988 arbitrary attributes are not allowed in Link I'm having trouble finding this statement in RFC 5988. The grammar also has a link-extension term which seemingly...
Pywb's documentation [explains](https://pywb.readthedocs.io/en/latest/manual/configuring.html#collection-provenance) the reasoning for the collection attribute as: > When using the auto-all collection, it is possible to determine the original collection of each resource by looking at...
Ah, I see. I had missed the anchor attribute. So what you're suggesting is this? ``` ; rel="memento"; datetime="Mon, 23 Mar 2020 13:37:04 GMT", ; rel="collection"; anchor="https://pywbtest.ws-dl.cs.odu.edu/example/20200323133704mp_/https://example.com/", ```
@kitarp29 I don't have commit access to this repository and I don't work for the Internet Archive. Please don't spam at-mentions to random people. Thanks!
My understanding is the "trans" in maxTransHops is [transclusion](https://en.wikipedia.org/wiki/Transclusion) (which often people call 'embed'), so applies to stylesheets, images etc not normal links. > Is there any setting for the...
Supporting HTTP/2 would likely involve writing a new FetchHTTP module on top of Apache HttpClient 5 or another HTTP client library. The current mechanism Heritrix uses for recording responses will...
Feel free to send a pull request updating the README or documentation as you feel would be appropriate. I'm no academic and don't know what the normal practice is but...
Since neither FetchHTTP choosing to not download the request body nor the WarcWriter choosing not to write the record changes fetch status code of the CrawlURI it's still considered a...