waybackpack icon indicating copy to clipboard operation
waybackpack copied to clipboard

Download the entire Wayback Machine archive for a given URL.

Results 23 waybackpack issues
Sort by recently updated
recently updated
newest added

Given: ``` $ waybackpack --uniques-only --no-clobber --follow-redirects --ignore-errors -d StevenHarman 'http://feeds.stevenharman.net/stevenharman' --from-date=20090220053814 ``` I get: ``` INFO:waybackpack.session: HTTP status code: 504 INFO:waybackpack.session: Waiting 1 second before retrying. INFO:waybackpack.session: Waiting 1...

This PR enables wildcard url fetches from wayback machine.

Add a param to download whole site with assets, not just pages (as of right now it only captures html page of the site)

I would like to be nice to the Wayback Machine and space out my requests. An option to insert a delay of x seconds between fetching each page would allow...

archive.is?

feature-request

any change to get a feature where we can download a site from a range of dates? for example 2015-Today to try and get every copy of a URL, but...

feature-request

Hello, I've tried Waybackpack, and the Internet Archive sometimes refuses connexions for apparently no reason, as the refused link can be openned in the browser. I hoped that `--ignore-errors` would...

Hey, I'm running a somewhat simple command: `wayback_machine_downloader absglobal.com --all-timestamps --from 20110101000000 --to 20221231235959 --concurrency 5 --only "/(\/$|\.(html|htm|aspx)$)/i" --all` The downloader somewhat works. I get quite a few errors like...

> DEPRECATION: waybackpack is being installed using the legacy 'setup.py install' method, because it does not have a 'pyproject.toml' and the 'wheel' package is not installed. pip 23.1 will enforce...

Hi, I installed Waybackpack in hopes I could download all the swf files located here. "http://flashvortex.com/images/" In the Wayback Machine, I can access them here. "https://web.archive.org/web/*/http://flashvortex.com/images/*" This is all great,...