benoit74
benoit74
Looks like you stumbled upon a random failure of the mediawiki instance. [Previous run](https://farm.openzim.org/pipeline/bd28ec67-94f6-4c0d-8f45-f50c4307c57a/debug) failed due to https://github.com/openzim/mwoffliner/issues/2389 ; I did not implemented retries on 500 errors because I expected...
Opened https://github.com/openzim/mwoffliner/issues/2400
Upstream bug is solved, but new run did not succeeded: https://farm.openzim.org/pipeline/1eb461ba-ad63-490c-a97d-8fa2cd5b4dd2/debug I contacted vikidia team to ask for guidance (in French, so I'm not copy-pasting discussion here). It is disturbing...
Vikidia team responded that configuration is fine on their end, and that the traffic generated by the scraper is very acceptable from their point of view. They suspected the 524...
Vikidia team says the 524 is a timeout by Cloudflare due to upstream (Mediawiki) timeout on big pages. They suggest we try again after 5 minutes to not overwhelm the...
We've finally achieved to rebuild FR versions @Popolechien can you please have a look at https://dev.library.kiwix.org/#lang=&q=vikidia and confirm we can move these recipes back to production? (I will assume other...
Still failing, with other error. Waiting for upstream changes around handling of errors.
Good candidates are probably: - [minecraft.wiki](https://minecraft.wiki/) (https://browse.library.kiwix.org/viewer#minecraftwiki_en_all_maxi/) - [wiki.restarters.net](https://wiki.restarters.net/Main_Page) (https://browse.library.kiwix.org/viewer#restarters_en_all_maxi/) If we confirm darkreader is sufficiently good, then we need to check how we wanna use it. Looks like it...
See https://www.reddit.com/r/Kiwix/comments/1iicz96/can_i_archive_the_entirety_of_reddit/ Basically, there are dumps of reddit pushed to https://academictorrents.com/browse.php?search=reddit Might be interesting source of data for a scraper We mind need to create multiple ZIMs, tbd.
Recipe created at https://farm.openzim.org/recipes/www.baseball-reference.com_en and requested