benoit74

Results 604 issues of benoit74

https://github.com/openzim/mwoffliner/issues/2318 exhibit an issue around Xapian. Digging deeper, I narrowed down the problem: - it is linked to item title - when item title is 122 times or less the...

bug
question

We have tests in python-scraperlib around the suggestions and full-text searches. One of them is now failing with python-libzim 3.7.0 (linked to libzim 9.3.0): https://github.com/openzim/python-scraperlib/blob/fef63f81fdb9dd6d2a5e17d9c8785e3fd22665e9/tests/zim/test_indexing.py#L114-L144 Test is failing on this...

question
documentation

For https://github.com/openzim/mwoffliner/issues/2180 I had to analyze the ZIM content. I did it with `python-libzim` binding because I'm way more comfortable with it. The struggle I had (which luckily was not...

enhancement

https://github.com/openzim/libzim/issues/734 already highlights that user might want to search for documents in a specific language, but it focuses mostly only on the technical issues of using proper stemmer / stop...

enhancement
question

https://github.com/openzim/libzim/pull/950 moved some settings from env parameters to CPP exposed settings. This has been released in libzim 9.3.0. https://github.com/openzim/libzim/pull/960, https://github.com/openzim/libzim/pull/974, https://github.com/openzim/libzim/pull/975 and https://github.com/openzim/libzim/pull/982 pushed this further by allowing to configure...

enhancement

https://github.com/openzim/libzim/pull/950 released in libzim 9.3.0 removed the support for configuration of cache behavior with `ZIM_CLUSTERCACHE`, `ZIM_DIRENTCACHE` and `ZIM_DIRENTLOOKUPCACHE` environment variables, since it is now possible to do so via CPP...

bug

Is it possible to customize the href target of the viewer? In scenarii where kiwix-serve is hosted / hidden in a subpath of the server, it would be handy to...

question
kiwix-serve

The list of categories seems to be static, e.g. it misses at least phet, video, iFixit, ... and blank category (more a library problem, but we need to be able...

enhancement
question

Crawler version : 1.5.1 (will update "soon") We have a situation where the crawler gets interrupted due to the time limit but never exits. I sent a SIGTERM to the...

Command to repro: ``` docker run -v $PWD/output:/output --name crawlme --rm webrecorder/browsertrix-crawler:1.5.4 crawl --url "https://www.survivorlibrary.com/index.php/Accounting" --scopeType host --cwd /output --sizeLimit 100000000 ``` Expected behavior: crawler should stop when archive size...