benoit74
benoit74
https://energypedia.info uses a specially crafted skin: `timis`, see https://energypedia.info/api.php?action=query&meta=siteinfo&formatversion=2&siprop=skins Needed for openzim/zim-requests#763
See https://farm.openzim.org/pipeline/9fa1afb0-37b9-4f37-a0b4-0fb3b8004190/debug and Problematic URLs are https://pokemon.fandom.com//rest.php/v1/page/Map%3AKanto/html and https://fallout.fandom.com//rest.php/v1/page/Map%3AFO76_Silo_Replacement_Mainframe_Cores/html When browsing the wiki, these pages look special, the URL is automatically transformed with an `?` trailing character, e.g. from `https://pokemon.fandom.com/wiki/Map:Kanto`...
Command used to repro: ``` mwoffliner --webp --mwUrl="https://www.appropedia.org" --format="novid" --verbose="log" --publisher="openZIM" --adminEmail="[email protected]" --customZimTitle="Test" --customZimLanguage="eng" --customZimDescription="Test" --forceRender RestApi --articleList "Osteomyelitis" ``` Online: https://www.appropedia.org/Osteomyelitis Version: 1.14.1-dev0 Error: ``` [error] [2025-02-05T20:44:29.522Z] Unable to...
To investigate https://github.com/openzim/mwoffliner/issues/2137, I've used [Clinic Doctor](https://clinicjs.org/doctor/) We should document this tool usage somewhere. Not sure where we should document such generic tools usage, but I feel like it would...
Currently, libzim API allows to add a redirect from one path to another path. In multiple scrapers (mwoffliner, freecodecamp, mindtouch, youtube ...), we need to add redirects so that we...
We are currently developing a new Vue.JS based UI for Gutenberg scraper. In this process, we are following new https://wiki.openzim.org/wiki/Architect_your_ZIM_file guidelines and we've implemented a No-JS fallback UI. While we...
ZIMing a website with millions of page is a problem. Some folks achieved to do that with significant manual interventions but stopping the crawler regularly and restarting it, so that...
There are situations where it would make sense to translate content before putting it inside a ZIM. https://www.reddit.com/r/Kiwix/comments/1p6geda/translating_a_website_beforewhile_zim/
Docker image for ARM64 is not building anymore. I've disabled this architecture image generation for now, but we need to fix this situation. See https://github.com/openzim/zimit/issues/410#issuecomment-2641948230
See https://github.com/openzim/zim-requests/issues/401 and https://github.com/webrecorder/browsertrix-crawler/issues/801 Looks like everything is fine on crawler side, but it does not work inside the ZIM ...