benoit74
benoit74
This is same problem, the scraper is behaving very badly regarding images when ran outside wikimedia cloud VMs (and probably only when images have not yet being uploaded to our...
Note that wikipedia_ru_all_maxi_2025-06 has more images now: https://browse.library.kiwix.org/viewer#wikipedia_ru_all_maxi_2025-06/%D0%A3%D0%BB%D0%B8%D1%86%D0%B0_%D0%A8%D0%B5%D0%B2%D1%87%D0%B5%D0%BD%D0%BA%D0%BE_(%D0%A1%D0%B0%D0%BD%D0%BA%D1%82-%D0%9F%D0%B5%D1%82%D0%B5%D1%80%D0%B1%D1%83%D1%80%D0%B3) A bit surprised that size difference is not much. Plus I saw many image download errors in the log, but all I...
Marking this issue as "upstream" in the sense that nothing has to be fixed directly in this issue but in other issues.
Yes, we all got confused. Backoff strategy + failing scraper when there is too many images missing.
Problematic code is in this function (and its call tree): https://github.com/openzim/mwoffliner/blob/6568693185f79e3e0863f4d0f9770ffbb372edfd/src/util/saveArticles.ts#L21 Only fixing backoff strategy does not seems really promising, the problem is different and more important than that. ##...
Some existing recipe are wrong (e.g. https://farm.openzim.org/recipes/canadian_prepper_bugoutroll_en has a bad name). You should prefer to clone a good recipe than a one with errors. Even if ultimately we should not...
`format` field issue fixed, sorry for the inconvenience
For now only admins can delete recipes, you do not have sufficient permissions. Editors can only archive the recipes, since in general there is no good reason to delete permanently...
I sent a mail to their contact address to check with them if they have any plan to make an API available in the near term or any other solution.
I did not get any answers to my email ... Le sam. 11 juin 2022 à 11:23, Kelson ***@***.***> a écrit : > Any update here? > > — >...