zim-requests icon indicating copy to clipboard operation
zim-requests copied to clipboard

professeurphifix.net

Open barbayellow opened this issue 4 years ago • 4 comments

  • Website URL: https://www.professeurphifix.net/
  • License: Copyright. However: Les contenus du site peuvent être utilisés dans le cadre d’un usage individuel à l’exclusion de toute exploitation commerciale. La reproduction, l’exploitation ou l’extraction à des fins commerciales, de tout ou partie des éléments contenus dans le site est strictement interdite à défaut du consentement écrit et préalable de Philippe Arnoux
  • Desired ZIM Title: Professeur Phifix
  • Desired ZIM Description: Ce site met à disposition des ressources pédagogiques pour les enseignants et les élèves. Trouvez des fiches pédagogiques à imprimer, des leçons en vidéo et des exercices interactifs en lignes.
  • Desired ZIM Icon –png (URL or attach one): https://www.professeurphifix.net/img/titre1bis_perrine.png
  • Language (ISO 639-3): fra
  • Is this a MediaWiki?: no

barbayellow avatar Dec 01 '21 15:12 barbayellow

For the record, we got an agreement from the author to include this content to the Kiwix library.

letompouce avatar Apr 13 '22 09:04 letompouce

Recipe created at https://farm.openzim.org/recipes/www.professeurphifix.net_fr_all ; for now limited to 100 pages to check behavior

benoit74 avatar Mar 24 '25 21:03 benoit74

I removed the limited and added a custom CSS, but now I realize we have a problem with most PDFs.

https://dev.library.kiwix.org/content/www.professeurphifix.net_fr_all_2025-03/www.professeurphifix.net/orthographe_impression/ortho_a_1.html vs https://www.professeurphifix.net/orthographe_impression/ortho_a_1.html

Image vs Image

I will have a look, don't know how I missed it in the 100 pages test ; maybe I was too focused on videos which worried me the most ^^

benoit74 avatar Mar 28 '25 09:03 benoit74

I had to tweak a bit the crawler to retrieve the PDFs (with --selectLinks "a[href]->href,embed[src]->src" for future me), but it is not sufficient, there is a kind of bug (even with the WARC on replayweb.page). I've opened https://github.com/webrecorder/browsertrix-crawler/issues/801 to seek guidance.

benoit74 avatar Mar 28 '25 09:03 benoit74

Upstream issue has been fixed at replay side, so this might be a zimit/warc2zim issue? To be investigated

benoit74 avatar Jun 12 '25 12:06 benoit74

This is anyway an upstream issue, so I've opened the issue in zimit for now.

benoit74 avatar Sep 25 '25 20:09 benoit74

Seeing as @barbayellow has left BSF and the website is flagged as © 2025 Professeur Phifix I would actually recommend closing the issue

Popolechien avatar Sep 26 '25 07:09 Popolechien

Well, it's not about @barbayellow :p But to be honnest: the project from which originated the request was terminated long ago (we actually are not in the country/field anymore).

Overall I agree about the copyright being a risk for the Kiwix Library context, I'd close the issue.

letompouce avatar Sep 26 '25 20:09 letompouce