rgaudin

Results 845 comments of rgaudin

I thought the discussion was to use CT instead of is_front. I must have missed some comments

Versatile doesn't prevent us from having a better support for ZIM where we have control. Regarding urls in scrapers, you are well aware that we do ways to make it...

Being a dinosaur, I'd use the `X-` prefix. [MDN docs](https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers) says: > this convention was deprecated in June 2012 because of the inconveniences it caused when nonstandard fields became standard...

> Make metrics ask the "zim file" if the url is a front or not. When metric detects (by heuristics) that a url may be front article, it opens the...

@veloman-yunkan those are valid questions but those are scraper level ones… probably for each scraper. @kelson42 mentioned the WP1 data for mwoffliner. Should this popularity information only feed the indexer...

> Metadata is retrieved from the scrape This is not really possible unless we decide that we (ie. not the scraper) find, parse, fetch and process those on our own.

> I checked and found none, but I may have read the code too fast / missed something It's in warc2zim: - Language - Illustration - Source (set to the...

There is no policy for `.hidden/`. We've discussed (and I think there is a ticket with a proposal from @benoit74) regarding dev but there is nothing regarding `.hidden/custom_apps`. While I...

Do you mean we should archive each ZIM that is used in a custom app separately, for an undefined (yet) amount of time?

What would be the input? A list of bookshelves IDs to include?