grobid icon indicating copy to clipboard operation
grobid copied to clipboard

endpoint to process fundings

Open Aazhar opened this issue 7 months ago • 2 comments

We have to process the document completely ( fulltext ) in order to extract fundings, which can be a bit cumbersome for instance when the uploaded user files are processed in real time.

An idea would be to have an endpoint that deals with the fundings in the same way we already have an endpoint that processes cited literature.

Aazhar avatar Nov 13 '23 09:11 Aazhar

Implemented with bb4b3591e51a084063791b6454f26faddfaf575b

kermitt2 avatar Nov 19 '23 17:11 kermitt2

I observe a runtime divided by 2 as compared to the complete full text processing. However, processing header alone is 50% faster. So this is the following typical scale for average document: fulltext 3s, header+funding:1.5s, header: 1s.

Endpoint is /api/processHeaderFundingDocument

kermitt2 avatar Nov 19 '23 17:11 kermitt2