nextcloud-metadata icon indicating copy to clipboard operation
nextcloud-metadata copied to clipboard

Read PDF custom properties

Open Devangarde opened this issue 3 years ago • 13 comments

Would you consider supporting PDF metadata? Not only defaults (creator, author, producer) but also custom properties.

Devangarde avatar Jul 23 '20 16:07 Devangarde

Yes, support for PDFs is in the plans, but still not finished.

As for the custom properties, it would be helpful if you could upload a sample, stating what would you expect to see.

gino0631 avatar Jul 24 '20 22:07 gino0631

With this I could maybe get rid of Zotero.. if it's searchable and then later on also get's a function to export by bibtex or so this would be huge..

Dubidubiduu avatar Oct 14 '20 15:10 Dubidubiduu

Hello, have you got some news regarding this ? (this feature should be great, as docx support and common MS others)

julian70400 avatar Dec 16 '21 15:12 julian70400

Hi, any update on this? PDF metadata would be amazing - it would make managing pdf libraries much much easier.

kormose avatar Aug 15 '22 11:08 kormose

@kormose thanks for the ping! I have committed the support for simple metadata from PDF Document Information Dictionary. It would be nice if people could test before the app is released.

gino0631 avatar Aug 28 '22 12:08 gino0631

Pdf more metadata will test this.

Piefje01 avatar Dec 19 '22 10:12 Piefje01

@Piefje01 not sure what do you mean?

gino0631 avatar Dec 19 '22 13:12 gino0631

I only see this

Gemaakt: 2022-12-19 08:58:28 +00:00 Aangepast: 2022-12-19 09:00:15 +00:00 Application: ocrmypdf 13.4.0+dfsg / Tesseract OCR-PDF 4.1.1 Number of pages: 3 PDF producer: pikepdf 5.0.1+dfsg PDF version: 1.7

But there is more metadata.

Titel Auteur Languages

But i dont see it.

Piefje01 avatar Dec 19 '22 13:12 Piefje01

It would be good to have a sample for analysis.

gino0631 avatar Dec 19 '22 13:12 gino0631

Oke i will send you one PDF Document2 - jef label 43.pdf Document2.pdf

Piefje01 avatar Dec 19 '22 13:12 Piefje01

OK, with "Document2.pdf" I see this: image

With "Document2 - jef label 43.pdf" it's a bit more: image

Do you get different results, or expect something else?

gino0631 avatar Dec 21 '22 21:12 gino0631

Hi,

Here a other example

I see this

Titel: Untitled Gemaakt: 2020-10-23 17:35:11 +02:00 Aangepast: 2023-01-10 09:50:08 +00:00 Application: ocrmypdf 13.4.0+dfsg / Tesseract OCR-PDF 4.1.1 Number of pages: 11 PDF producer: pikepdf 5.0.1+dfsg PDF version: 1.7

There must also be

ISSN 2045 DOI 10.1038

Legal note

Accesion Nuimber

Keywords

Abstract

Notes

And more fields

Met vriendelijke groet / With kind regards,

Jefta van Eijk

RisDATAcom B.V Phone: +31 (0) 85 111 0186 @.*** | www.risdatacom.nl

From: @.*** (gino0631)" @.> To: gino0631/nextcloud-metadata @.> Cc: Piefje01 @.>, Mention @.> Date: Wed, 21 Dec 2022 13:16:22 -0800 Subject: [*** CBV ***] Re: [gino0631/nextcloud-metadata] Read PDF custom properties (#65)

OK, with "Document2.pdf" I see this:

With "Document2 - jef label 43.pdf" it's a bit more:

Do you get different results, or expect something else? — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.

De informatie verzonden in dit e-mail bericht is uitsluitend bestemd voor de geadresseerde. Gebruik van deze informatie door anderen dan de geadresseerde is verboden. Indien u dit bericht ten onrechte ontvangt, wordt u verzocht de inhoud niet te gebruiken maar de afzender direct te informeren door het bericht te retourneren en het daarna te verwijderen. Openbaarmaking, vermenigvuldiging, verspreiding en/of verstrekking van de in de e-mail ontvangen informatie aan derden is niet toegestaan. RisDataCOM/Servicenet staat niet in voor de juiste overbrenging van een verzonden e-mail, noch voor de tijdige ontvangst daarvan. Externe e-mail wordt door RisDataCOM/Servicenet niet gebruikt voor het aangaan van verplichtingen.

Engels The information contained in this communication is confidential and may be legally privileged. It is intended solely for the use of the individual or the entity to whom it is addressed and the others authorised to receive it. If you are not the intended recipient, you are hereby notified that any disclosure, copying, distribution or taking any action in reliance of the contents of this information is strictly prohibited and may be unlawful. RisDataCOM/Servicenet is neither liable for the proper and complete transmission of the information contained in this communication nor for any delay in its receipt. Any e-mail messages from RisDataCOM/Servicenet are given in good faith but shall not be binding nor shall they construe any obligation

Piefje01 avatar Jan 10 '23 18:01 Piefje01

@Piefje01 have you attached the PDF? I can't see any link in the post above for some reason.

gino0631 avatar Jan 12 '23 16:01 gino0631