benoit74
benoit74
In addition to page `--include` and `--exclude`, it is also possible to customize pages resources block rules `--blockRules`. This should be made available in zimit. See https://crawler.docs.browsertrix.com/user-guide/crawl-scope/#page-resource-block-rules We should add...
After #351, we have daily tests of Youtube player behavior inside the ZIM We should automate all known working test cases of the test website at https://website.test.openzim.org/ to ensure quick...
We have regularly Puppeteer error. One such error seems however quite easy to reproduce on http://darksouls.wikidot.com/ or http://darksouls2.wikidot.com/ or http://darksouls3.wikidot.com/
See https://github.com/webrecorder/browsertrix-crawler/issues/631
Recipe: https://farm.openzim.org/recipes/thecodelesscode.com_en_all Bug: the crawler does not find the /topics and /contents and /names links on homepage (and maybe others). Tests done: - passing a mobile device in landscape (Pixel...
ZIM names must be unique to a single recipe, no matter the warehouse path / offliner in use, to avoid conflicting / overriding recipes and hard to debug situations. Even...
Currently, the artifacts expiration setting is saved in the task, together with the upload link. But in the worker, the expiration setting is never used by the task worker which...
For fields which have a limit on number of characters, it would be super helpful to: - enforce the limit while editing the field (currently one has to save the...
Currently when one want the manager to stop starting new task, it is needed to completely stop the container. This has many drawbacks: - worker seems dead in the UI...
See logs below. Polling interval is supposed to be 180secs, i.e. every 3 minutes, but this is not what happens, interval is significantly bigger. ``` [2024-10-04 09:24:54,922: INFO] starting zimfarm...