pywb
                                
                                
                                
                                    pywb copied to clipboard
                            
                            
                            
                        Core Python Web Archiving Toolkit for replay and recording of web archives
I would like to activate the selection of a collection in proxy mode via proxy auth. In this mode, a user would access a pywb instance via [proxy](https://pywb.readthedocs.io/en/latest/manual/configuring.html#https-proxy), and instead...
## Describe the bug When clicking the 'Next Page' button of an archived page in PyWB, a 502 Bad Gateway Response is received after triggering a POST request. ## Steps...
## Describe the bug When triggering a POST request by clicking the 'Load More' button of an archived page replaying in PyWB, we get a 502 Bad Gateway Response. ##...
## Description This PR fixes the issue #865. Specifically, it appears that the bug was introduced (perhaps by accident) by commenting out lines 288-289 from `responseloader.py`. It appears that uncommenting...
Preventing the Python 3.12's "Deprecated since version 3.12: Use datetime.now() with UTC instead."
## Description See https://github.com/webrecorder/pywb/issues/913 ## Types of changes - [ ] Replay fix (fixes a replay specific issue) - [x] Bug fix (non-breaking change which fixes an issue) - [...
That's the thing: we have file systems with transparent compression nowadays (and to think this started with Stacker on MS-DOS!), so it makes sense to use **uncompressed** WARC files on...
Modern file systems now have compression schemes that surpasses userland compression schemes in convenience and sometimes even on efficiency, rendering the current use of `gzip` inconvenient when such filesystems are...
I'm running `pywb` on Python 3.12 nowadays, and I had noticed the following message on the logs now and then: ``` Deprecated since version 3.12: Use datetime.now() with UTC instead....
## Describe the bug In client_side_replay mode, replaying harvested PDFs that were hosted on Google Drive will only display the first page. All pages will replay correctly in Browsertrix or...
## Issue The National Diet Library, Japan (NDL) https://warp.ndl.go.jp/, is currently in the process of transitioning from OpenWayback to pywb. During this transition, we have identified a significant issue related...