Patrice Lopez

Results 601 comments of Patrice Lopez

Hello ! These digit characters correspond to font glyphs that are not mapped correctly to unicode. So what needs to be done is to examine the PDF, identify the font...

Hi Luca, I think the current principle for training is to label indeed all possible heading and sub-heading just with ``, independently from the hierarchical level. Then, recreating a hierarchy...

Hello !! With Grobid, most of the time the DOI appears under `` too. Grobid places DOI under analytic when the bibliographical item is a piece (article, chapter, ...) included...

The Grobid ODD schema authorises `` under `` :) With Grobid I think that the DOI should appear in general under `` like in the arxiv example for standalone bibliographical...

Apparently from the examples, for cited references Grobid always put `` under `` or `` as expected at the right level (I corrected my message above). I would say ``...

@lfoppiano yes it would be a simple solution - the drawback of this is multiplication of labels (model harder to train), but it is okay I think in this case,...

Hello! There is a training web API already part of the Grobid service (typically as container with mounted paths), to start a training, get progress info, evaluation and fetch the...

Hi @lfoppiano ! Thank you. Indeed simply removing the `--add-opens jdk.incubator.foreign` makes it working with jdk 11, which is funny because I added all these modules, including this specific one,...

Normally we should cover this by adding of few examples of this cover page in the segmentation model. DO we have JSTOR CC-BY content ?

Hello ! Thanks for reporting this issue. This is surprising indeed. Normally it would be due to pdf2xml because the version in Windows is currently a bit different from the...