rgaudin
rgaudin
I thought the discussion was to use CT instead of is_front. I must have missed some comments
Versatile doesn't prevent us from having a better support for ZIM where we have control. Regarding urls in scrapers, you are well aware that we do ways to make it...
Being a dinosaur, I'd use the `X-` prefix. [MDN docs](https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers) says: > this convention was deprecated in June 2012 because of the inconveniences it caused when nonstandard fields became standard...
> Make metrics ask the "zim file" if the url is a front or not. When metric detects (by heuristics) that a url may be front article, it opens the...
@veloman-yunkan those are valid questions but those are scraper level ones… probably for each scraper. @kelson42 mentioned the WP1 data for mwoffliner. Should this popularity information only feed the indexer...
> Metadata is retrieved from the scrape This is not really possible unless we decide that we (ie. not the scraper) find, parse, fetch and process those on our own.
> I checked and found none, but I may have read the code too fast / missed something It's in warc2zim: - Language - Illustration - Source (set to the...
There is no policy for `.hidden/`. We've discussed (and I think there is a ticket with a proposal from @benoit74) regarding dev but there is nothing regarding `.hidden/custom_apps`. While I...
Do you mean we should archive each ZIM that is used in a custom app separately, for an undefined (yet) amount of time?
What would be the input? A list of bookshelves IDs to include?