jabref icon indicating copy to clipboard operation
jabref copied to clipboard

Extract text about papers from "related work" sections

Open koppor opened this issue 2 months ago • 10 comments

In science, authors write papers. They related their paper to other papers. This text is very interesting, as it contains two aspects:

  • interesting other papers
  • description of other papers

Example: https://github.com/JabRef/jabref-demo-libraries/blob/main/chocolate/pdfs/LunaOstos_2024%20-%20Social%20Life%20Cycle%20Assessment%20in%20the%20Chocolate%20Industry%20-%20A%20Colombian%20Case%20Study%20with%20Luker%20Chocolate.pdf

Colombia is a middle-income country with a population
of approximately 50 million (CIA 2021), with at least 11
million people living in rural areas (DANE 2018). It is the
third most biodiverse country globally, following Brazil
and Indonesia (Nash 2022). 

JabRef should do following:

For each reference:

  • Lookup in the references
  • Add to library - or update, if already exists
  • Find out descriptive text for the paper in the text
  • Add the desriptive text to comments-{username}, prefixed with [{citation-key}]: ([LunaOstos_2024] in our example). In case there is already content in comments-{username}, just append it. Separated by an empty line.

Example result:

@Misc{Agency2021,
  author         = {{Central Intelligence Agency}},
  note           = {Accessed 4 Mar 2023},
  title          = {The world factbook: Colombia},
  year           = {2021},
  comment-koppor = {[LunaOstos_2024]: Colombia is a middle-income country with a population of approximately 50 million.},
  url            = {https://www.cia.gov/the-world-factbook/countries/colombia/},
}

Related: Citation relations. However, they do not have the full text.

Screenshot from the linked PDF:

Image

Fuller context:

Image

Hint: It is perfectly OK to use the langchain4j's AI interface to parse etc.


This is NOT citation relations, because this issue here is about to harvest knowledge from a PDF.

koppor avatar Oct 13 '25 19:10 koppor

Hi @koppor,

I’m interested in working on this issue and would like to start implementing a first version of the feature. My plan is to begin by:

-extracting text from the “Related Work” section of a given PDF, -identifying referenced papers that already exist in the library (or adding them if missing), and -appending a short descriptive snippet to the corresponding comment- field, formatted as [citation-key]: summary text, while preserving any existing content.

If that approach aligns with what you had in mind, could you please assign the issue to me or confirm that it’s fine for me to begin working on it?

Thanks!

jsochava avatar Oct 25 '25 03:10 jsochava

👋 Hey @jsochava, looks like you’re eager to work on this issue — great! 🎉 It also looks like you don't know how to assign issues to yourself. Please read our CONTRIBUTING.md to find out how. It will also guide you in other aspects of contributing to JabRef.

github-actions[bot] avatar Oct 25 '25 03:10 github-actions[bot]

/assign-me

jsochava avatar Oct 25 '25 03:10 jsochava

👋 Hey @jsochava, thank you for your interest in this issue! 🎉

We're excited to have you on board. Start by exploring our Contributing guidelines, and don't forget to check out our workspace setup guidelines to get started smoothly.

For questions on JabRef functionality and the code base, you can consult the JabRef Guru or ask on our Gitter chat.

In case you encounter failing tests during development, please check our developer FAQs!

Having any questions or issues? Feel free to ask here on GitHub. Need help setting up your local workspace? Join the conversation on JabRef's Gitter chat. And don't hesitate to open a (draft) pull request early on to show the direction it is heading towards. This way, you will receive valuable feedback.

Happy coding! 🚀

github-actions[bot] avatar Oct 25 '25 03:10 github-actions[bot]

/assign-me @github-actions

omkarrr2533 avatar Dec 07 '25 08:12 omkarrr2533

@omkarrr2533 edits don't work, you need to write the correct message.

koppor avatar Dec 07 '25 21:12 koppor

/assign-me

omkarrr2533 avatar Dec 08 '25 09:12 omkarrr2533

⚠️ Assignment Limit Reached

Hi @omkarrr2533, you've reached the maximum number of 1 concurrent assignments.

[!NOTE] Please complete or unassign from your current issues before taking on new ones.

Current options
  • Complete one of your assigned issues and close it
  • Use /unassign-me on an issue you're no longer working on
  • Ask a maintainer for an exception if this is a special case

jabref-machine avatar Dec 08 '25 09:12 jabref-machine

/assign-me

Muskan244 avatar Dec 08 '25 15:12 Muskan244

Thank you for your continued work in JabRef. At this point, you probably already know the basics and the important links. If not, our contributing guidelines is the place you should go to.

Happy coding! 🚀

jabref-machine avatar Dec 08 '25 15:12 jabref-machine