Patrice Lopez
Patrice Lopez
It has some positive impact on the runtime too, although most of the computation take place outside Wapiti: Before: ``` PDF processing 100% │████████████████│ 1943/1943 (0:08:03 / 0:00:00) ``` After...
Hi @elonzh ! I think XML schema is the "right" output/exchange format for structured full text document (I would say everybody in publishing use it and all the scientific full...
Hi @lfoppiano ! This branch will require quite a few tests I think (I suspect it will raise problems to some of the grobid modules and I need to check...
Hi Achraf ! Indeed, without consolidation we have the author "Elena Bozzetta" associated to the ORCID of the previous author, while she does not have an ORCID on this paper....
Hi @elonzh ! I have planed to work on this in #963, identifying sub-parts in the a figure (by improving SVG/bitmap clustering) and sub-labels, but not sure about the result...
I went through the difference run tasks and adapted gradle for setting `java.library.path` in every cases. Tested on Linux-64, it works with OpenJDK 1.15 for the service, training commands, tests...
Hello @Siedlerchr Can you share the PDF to reproduce the problem?
Hi @pureblacker Thanks for testing Grobid ! Normally (when it works), the coordinates of the full figure (including figure title, captions, graphics) are given by the `@coord` attribute on the...
This is implemented with 2557847d086181fc900db1aa9182b1f1f19504cf - default format for definitions is the original MediaWiki format (as before) - optional supported format is "plain text" REST API parameter is `definitionFormat` with...
Also added `html` as format: ```console curl -X GET http://localhost:8090/service/kb/concept/Q190712?definitionFormat=html ``` ```json { "rawName" : "First Battle of the Marne", "preferredTerm" : "First Battle of the Marne", "confidence_score":0, "wikipediaExternalRef":171325, "wikidataId"...