zim-requests
zim-requests copied to clipboard
mdwiki is failing
Recipe URL
https://farm.openzim.org/recipes?name=mdwiki
Last log lines
"method": "get",
"url": "https://mdwiki.wmcloud.org/w/api.php?action=query&format=json&prop=redirects%7Crevisions%7Cpageimages&rdlimit=max&rdnamespace=0%7C3000%7C3002&redirects=true&titles=African_histoplasmosis%7CAfrican_tick_bite_fever%7CAfrican_trypanosomiasis%7CAge_and_female_fertility%7CAge_spot%7CAgenesis_of_the_corpus_callosum%7CAggregatibacter_actinomycetemcomitans%7CAggressive_NK-cell_leukemia%7CAggressive_digital_papillary_adenocarcinoma%7CAggressive_fibromatosis%7CAgomelatine%7CAgoraphobia%7CAgranulocytosis%7CAgustin_Walfredo_Castellanos%7CAicardi_syndrome%7CAicardi%E2%80%93Gouti%C3%A8res_syndrome%7CAinhum%7CAir_pollution%7CAir_pollution_forecasting%7CAir_pollution_measurement%7CAir_quality_index%7CAir_quality_law%7CAirborne_transmission%7CAirway_remodelling%7CAkathisia%7CAlagille_syndrome%7CAlastrim%7CAlbendazole%7CAlbinism%7CAlbinism_in_humans%7CAlbright's_hereditary_osteodystrophy%7CAlbuminuria%7CAlcaftadine%7CAlclometasone%7CAlcohol-related_brain_damage%7CAlcohol-related_traffic_crashes_in_the_United_States%7CAlcohol_Dependence_Data_Questionnaire%7CAlcohol_Use_Disorders_Identification_Test%7CAlcohol_abuse%7CAlcohol_and_breast_cancer%7CAlcohol_and_cancer%7CAlcohol_and_cardiovascular_disease%7CAlcohol_and_health%7CAlcohol_dependence%7CAlcohol_detoxification%7CAlcohol_education%7CAlcohol_flush_reaction%7CAlcohol_intoxication%7CAlcohol_law%7CAlcohol_packaging_warning_messages"
},
"code": "ERR_BAD_RESPONSE",
"status": 502
}
[error] [2024-07-20T16:44:43.052Z]
**********
Request failed with status code 502
How many times the recipe failed in a row?
Once
How many ZIM have been produced before failure?
Many
Which action did you undertake so far?
None, I do not have sufficient rights in Zimfarm but I recommend to disable the recipe for now
What's next?
This has to be reported to website admin (upstream website problem)
More details
This was not run on the farm, but I am seeing new 502 errors that I am pursuing.
@tim-moody Why we deal with https://mdwiki.wmcloud.org/ in place of mdwiki.org? Can you solve the bad upstream HTTP errors?
I made some fixes and reran:
T:12135; finish
[log] [2024-07-21T00:18:18.535Z] Summary of scrape actions: {
"files": {
"success": 101952,
"fail": 132
},
"articles": {
"success": 67992,
"fail": 734
},
"redirects": {
"written": 267373
}
}
[log] [2024-07-21T00:18:18.540Z] Finished dump
[log] [2024-07-21T00:18:18.540Z] Closing HTTP agents...
[log] [2024-07-21T00:18:18.540Z] All dumping(s) finished with success.
[log] [2024-07-21T00:18:18.540Z] Flushing Redis DBs
[info] [2024-07-21T00:18:18.540Z] Finished running mwoffliner after [22782s]
[log] [2024-07-21T00:18:18.541Z] Exiting with code [0]
[log] [2024-07-21T00:18:18.541Z] Deleting temporary directory [/dev/shm/mwoffliner-1721498317879]
Looks good to me.
Summary of changes:
- Use permalink medicine.tsv if current month not available to create mdwiki article list.
- Add custom user agent to all api calls to mdwiki.org to prevent blocking.
Would you be able to help with this @benoit74? See https://github.com/openzim/mwoffliner/issues/2069#issuecomment-2282540744. In sum, all that's needed is:
- Edit the mdwiki recipe so it uses mwoffliner:1.13.0 instead of mwoffliner:dev, and then
- Relaunch both mdwiki and mdwiki_app manually.
It's been a couple of months now since we've been able to release an updated WikiMed app (desktop), due to the maxi recipe using mwOffliner dev instead of 1.13.0 (seemingly).
I have the power to do it, but with great power comes great responsibilities ^^
Unfortunately for now @kelson42 requested to disable all mwoffliner recipes ... this needs to be discussed with him.
From my PoV, we should reenable all mwoffliner recipes with 1.13 since it looks like 1.14 is unfortunately not yet sufficiently ready and/or tests are not progressing fast enough.
I will bring the discussion to Slack to see if we can change current situation, thank you for insisting a bit.
@tim-moody I'm not sure you're on our Slack, or at least on the mwOffliner channel, but the outcome of the discussion there is that we can no longer use 1.13.0 due to a promise made to the Wikimedia people, and although I thought that a recent Wikivoyage ZIM was 1.13.0, it in fact was 1.14-dev, just its metadata were incorrect.
The bottom line is that if mwOffliner is failing to make an mdwiki with the dev branch, then we can't currently fix that, and we'll have to wait to produce any new WikiMed apps till we have a new ZIM. I don't know how important you feel it is to release apps based on the latest MDWiki info. It might be possible to use ZIMs produced by you if we can upload them to download.kiwix.org, because to build the app both ZIMs needs to be there (they are built via GitHub Actions).
I don't know if that's a good idea in practice... Maybe we should wait till the issues with dev have been ironed out, noting that it will need to use the new mobile HTML REST API, as the mobile-sections API is now deprecated.
@kelson42 thanks very much for your clear explanation. I was not aware of discussions with WMF, and it all makes sense now. I had also noticed that dev reports its version as 1.13.0, which adds confusion.
We are of course working to make mdwiki work with dev, but are not there yet, and my travels have also slowed things down.
I will try building the mdwiki-app recipe to see if that is an option. I would need credentials to upload to download.kiwix.org. I have spoke with James, and worst case he is OK with using a previous ZIM for the app.
Thanks for your ongoing help and support.
@tim-moody Although the Android app(s) use mdwiki-app, for the desktop app, I only use it for an old UWP version that cannot handle full-text search. For the Electron app, I use the normal maxi with ft index, so I don't think having just the -app version would help for my side, and I'm not sure how often the Android people release an updated mdwiki app.
In terms of your building with 1.13.0 (the real 1.13, not the dev masquerading as such), personally I don't think that would matter if the API is still working and hasn't been disabled, as one build is nothing compared to the massive hit on the mobile-sections API that running the full ZIM farm on it would represent. But that's your personal decision of course 😉.
Regarding credentials, I have these and could upload (so long as we have both versions: app_maxi and normal maxi), but I think we'd need agreement that this is OK as an emergency workaround before proceeding. Would library.kiwix.org pick up a ZIM uploaded to download.kiwix.org automatically? @benoit74?
I'm not sure of the value in releasing a new app with the last available (old) ZIM, if the old app with the same old ZIM is still working fine (which it is). We use a permalink to it in any case: https://kiwix.github.io/kiwix-js-pwa/wikimed/ . It could annoy people and waste their bandwidth!
@kelson42 @Jaifroid I will take everyone's advice and stop producing ZIMs until we can use dev. In the mean time we can use existing ZIMs and the app.
Regarding credentials, I have these and could upload (so long as we have both versions: app_maxi and normal maxi), but I think we'd need agreement that this is OK as an emergency workaround before proceeding. Would library.kiwix.org pick up a ZIM uploaded to download.kiwix.org automatically? @benoit74?
Yes, it is fully automated (currently refreshed once per hour at xx:30), also directly usable in our imager service, and all our readers using the OPDS catalog, e.g. Android, Apple, ... so handle with care. Especially since once published, someone might also start to download it, or use it to create a custom hotspot image with our imager service, ... lot's of frustration ahead if not done correctly.
Hi @tim-moody, where are we at with producing a new MD Wiki ZIM? I'm just conscious I haven't been able to release a new WikiMed app since June 16th, over three months ago...
Maybe we need to rethink, if dev is going to take too long, and do at least one interim release?
This post is not meant to pressure anyone. I'm just a bit confused about how to proceed. It's not just MD Wiki, by the way, all Wikimedia ZIM production has come to a complete stop at Kiwix (apart from two tiny Abkhazian and Bambara ZIMs).
@Jaifroid I have been away and not working on this. I need to catch up on where things stand. I was able to get part way through the run, but my 404 returns cause mwoffliner to crash.
MWoffliner should not crash and usually 404 are ignored (to the contrary to 5xx errors). Any of such behaviour should get its dedicated issue if confirmed.
At this stage, I don't see what Kiwix should/could do more and I will close the issue.
didn't mean to imply the problem was mwoffliner's