grobid-quantities icon indicating copy to clipboard operation
grobid-quantities copied to clipboard

MeasureAnnotationSaxHandler incorrectly label the beginning of an annotated entity

Open lfoppiano opened this issue 3 months ago • 0 comments

Entities such as <valueRange> and <valueBase> are missing the starting I- identifier:

65 65 6 65 65 65 5 65 65 65 NOCAPS ALLDIGIT 0 NOPUNCT dd d 0 0 I-<valueBase>
. . . . . . . . . . ALLCAPS NODIGIT 1 DOT . . 0 0 <valueBase>
5 5 5 5 5 5 5 5 5 5 NOCAPS ALLDIGIT 1 NOPUNCT d d 0 0 <valueBase>
± ± ± ± ± ± ± ± ± ± ALLCAPS NODIGIT 1 NOPUNCT ± ± 0 0 <other>
0 0 0 0 0 0 0 0 0 0 NOCAPS ALLDIGIT 1 NOPUNCT d d 0 0 <valueRange>
. . . . . . . . . . ALLCAPS NODIGIT 1 DOT . . 0 0 <valueRange>
8 8 8 8 8 8 8 8 8 8 NOCAPS ALLDIGIT 1 NOPUNCT d d 0 0 <valueRange>

Issue occurs in <valueRange> and the <unitLeft> following <valueRange>

Version 0.8.0 works as expected. For the next version we would need to update the SaxHandler and the models.

lfoppiano avatar Mar 28 '24 07:03 lfoppiano