papermage icon indicating copy to clipboard operation
papermage copied to clipboard

How to extract Authorname, Institution, Country from "authors" box

Open jasobro opened this issue 1 year ago • 2 comments

Hi,

At the moment it is possible to derive author information via doc.authors. Is it also possible to further finegrain this information and retrieve the authors name, their institution and country ? doc.authors returns all author information in a single string and I don't know how to retrieve the single entities.

jasobro avatar Dec 12 '23 18:12 jasobro

@jasobro you could check if METEOR or AutoMETA works for your use case. Also, we have experimented in using a finetuned GPT3.5 model for extracting bibliographic metadata.

That said, if Papermage could provide more detailed metadata, that would be quite helpful for many!

juhoinkinen avatar Dec 18 '23 06:12 juhoinkinen

@jasobro you could check if METEOR or AutoMETA works for your use case. Also, we have experimented in using a finetuned GPT3.5 model for extracting bibliographic metadata.

That said, if Papermage could provide more detailed metadata, that would be quite helpful for many!

your model link is 404 now...

xsank avatar Feb 27 '24 06:02 xsank