translators
translators copied to clipboard
Create DPG media.js
supporting DPG media publications with a very hard cookie wall Dutch:
- Algemeen Dagblad (ad.nl)
- BN DeStem (bndestem.nl)
- Brabands Dagblad (bd.nl
- De Ondernemer (deondernemer.nl)
- De Stentor (destentor.nl)
- Eindhovens Dagblad (ed.nl)
- De Gelderlander (gelderlander.nl)
- Provinciale Zeeuwse Courant (pzc.nl)
- De Twentsche Courant Tubantia (tubantia.nl)
Belgian publications:
- 7sur7 (7sur7.be)
- Het Laatste Nieuws (hln.be)
- De Morgen (DeMorgen.be)
Why not base this on Embedded Metadata? EM output for these sites is already good; it gives us the correct title (avoiding mistakes like
Cnn
instead ofCNN
, as in one of the test cases), author, abstract, date, language, and even section. It's just missing the correct publication title ("AD.nl" instead of "Algemeen Dagblad") and it'll need to be directed to interpret the page as anewspaperArticle
.
Yes the metadata is perfect when retrieved through the browser extension. When these websites are queried remotely, like with the Citoid API or the Zotero Scaffold, then you get stuck at myprivacy.dpgmedia.(be|nl). There is zero metadata to be found there. Zotero makes a bigger mess than if it had done nothing by changing the input URL to that of the cookie wall. This has lead to over 180 references in the English Wikipedia with the title "DPG Media Privacy Gate" and the redirect URL.
Another publication by DPG Media, volkskrant.nl, used to have this problem as well untill very recently. There are edits to Wikipedia that had the same "DPG Media Privacy Gate" title for that source that were made earlier this month. I don't see a Volkskrant Zotero Translator though, so the change must have been made on their end. Hopefully that is a change that will also be implemented on their more regional publications, but untill that time we have this mess growing bigger by the day on Wikipedia.