pywb
pywb copied to clipboard
Core Python Web Archiving Toolkit for replay and recording of web archives
An issue to track remaining documentation work * [ ] rules.yaml customization/fuzzy matching (move from wiki?) * [ ] template variable reference * [ ] per-collection metadata loading (`metadata.yaml`) *...
_(Please forgive me if this is the wrong venue for questions such as this. I'm not aware of any other avenue.)_ I'm super impressed with pywb and have enjoyed wrapping...
I wanted to attempt to see why #432 was happening. I am still new to it but I wonder if a small introduction to how this is structure and where...
This is to track the creation of an access control/exclusion system for pywb. The first stage of this effort is happening on the ukwa fork, for details see: ukwa/ukwa-pywb#7 The...
## Describe the bug I can't save the website x, probably because it requires a specific client certificate. If I'm not using pywb, I can load the website in chrome...
## Is your feature request related to a problem? Please describe. There are a few cases where the indexer cannot correctly create a CDX file from a WARC file. There...
## Describe the solution you'd like It should be possible to pass CDX files along with WARC files on `wb-manager add`. As things stand now, that command can take needlessly...
`pywb.utils.loaders.HttpLoader.load()` sets the value of `headers['Cookie']` which is used in the call `self.session.get(url, headers=headers, stream=True)`. When cookies are set this way, if the url redirects, the cookies are not sent...
## Describe the bug Goodle amp delivered ads use iframes (duh) but there is a quirk in how they work using FF that ensures that there is no rewriting of...
## Describe the bug Page: https://vr.with.in/watch/look-but-with-love-a-story-of-music Request: POST //auth.with.in/v1/devices/register Fails because the it becomes rewritten as http: not https: which is required Because the first request is sent to http:...