waybackpack issues

AttributeError: 'NoneType' object has no attribute 'status_code'

1

Given: ``` $ waybackpack --uniques-only --no-clobber --follow-redirects --ignore-errors -d StevenHarman 'http://feeds.stevenharman.net/stevenharman' --from-date=20090220053814 ``` I get: ``` INFO:waybackpack.session: HTTP status code: 504 INFO:waybackpack.session: Waiting 1 second before retrying. INFO:waybackpack.session: Waiting 1...

jasonkarns

Wildcard searching

5

This PR enables wildcard url fetches from wayback machine.

fgregg

Download whole site

27

Add a param to download whole site with assets, not just pages (as of right now it only captures html page of the site)

aolko

Inserting a sleep between each fetch request

5

I would like to be nice to the Wayback Machine and space out my requests. An option to insert a delay of x seconds between fetching each page would allow...

Derek-Jones

Support other sources?

2

archive.is?

justingoldberg

feature-request

question: able to download a website historically while only saving the 1st successful page?

2

any change to get a feature where we can download a site from a range of dates? for example 2015-Today to try and get every copy of a URL, but...

devinschumacher

feature-request

`--ignore-errors` and `--max-retries` do nothing against `ConnectionRefusedError`

1

Hello, I've tried Waybackpack, and the Internet Archive sometimes refuses connexions for apparently no reason, as the refused link can be openned in the browser. I hoped that `--ignore-errors` would...

malespiaut

Blank files

3

Hey, I'm running a somewhat simple command: `wayback_machine_downloader absglobal.com --all-timestamps --from 20110101000000 --to 20221231235959 --concurrency 5 --only "/(\/$|\.(html|htm|aspx)$)/i" --all` The downloader somewhat works. I get quite a few errors like...

Jack-Lewis1

pip deprecation warning:

1

> DEPRECATION: waybackpack is being installed using the legacy 'setup.py install' method, because it does not have a 'pyproject.toml' and the 'wheel' package is not installed. pip 23.1 will enforce...

pokeball99

I need to download all SWF files from Archive.

1

Hi, I installed Waybackpack in hopes I could download all the swf files located here. "http://flashvortex.com/images/" In the Wayback Machine, I can access them here. "https://web.archive.org/web/*/http://flashvortex.com/images/*" This is all great,...

MissionCyberSpace

waybackpack
waybackpack copied to clipboard

Metadata

AttributeError: 'NoneType' object has no attribute 'status_code'

Wildcard searching

Download whole site

Inserting a sleep between each fetch request

Support other sources?

question: able to download a website historically while only saving the 1st successful page?

`--ignore-errors` and `--max-retries` do nothing against `ConnectionRefusedError`

Blank files

pip deprecation warning:

I need to download all SWF files from Archive.

← Metadata

Owner

Metadata

waybackpack waybackpack copied to clipboard

Metadata

← Metadata

Owner

Metadata

waybackpack
waybackpack copied to clipboard