wayback-machine-downloader icon indicating copy to clipboard operation
wayback-machine-downloader copied to clipboard

429 Too Many Requests

Open lostact opened this issue 4 years ago • 6 comments

Hi, Recently archive.org has put a rate limit so making too much requests per minute will result 429 error and you have to wait some time to make more requests. So if you use concurrency (specialy above 4) some pages will be missing.

The issue can be easly solved by adding a sleep(n) then retry in download_file function if the http_error code is equal to 429. I don't know the rate limit details so i cant tell how much seconds we have to wait.

lostact avatar Apr 14 '20 22:04 lostact

How did you fix this? I installed the package using gem install. How can I add that sleep?

hockeybro12 avatar May 19 '20 21:05 hockeybro12

@lostact could you post your solution. No Ruby experience here!

AstroDan avatar May 25 '20 23:05 AstroDan

Or make a PR

dhouck avatar Jun 24 '20 01:06 dhouck

For starters, don't use a value higher than three with the -c switch. That should fix your problem! :)

PuccamiteTech avatar Aug 12 '20 15:08 PuccamiteTech

Did anyone come up with a fix for this ? Or alternatively can anyone confirm keeping to -c 3 or below a reliable solution ?

baob avatar Dec 06 '20 14:12 baob

I never hit rate limit with -c 4.

dbsanfte avatar Dec 15 '20 21:12 dbsanfte