grobid-quantities
grobid-quantities copied to clipboard
GROBID extension for identifying and normalizing physical quantities.
This PR aims to restrict the value parser incorrect recognition: - if the TIME entities are less than 2 characters, they are aggregated with the previous entity if it exists
This is a follow up of #115. To change from paragraph based to sentence based we would need the following: - implement the segmentation in the quantityparser to reduce the...
This PR will update grobid-quantities to Dropwizard 2.x
This PR adds the definition of mass accumulation rates and sedimentation rates, from @thorge. However, this PR is still a draft because as discussed in #120 we need to first...
Sorry for keeping you busy @lfoppiano. I'm not sure where exactly to discuss this as it is more of a question than an issue. If this is the wrong place,...
As related to #92 I'm opening a new issue since this looks more a bug than an enhancement... The lexicon loader is ignoring the `supportsPrefixes`, and is loading tons of...
The quantity CRF model recognizes numerical expressions with exponents on 10 (in particular distorted one due to PDF text extraction):  However we are not currently parsing it (in their...
Some non SI units are not yet supported for normalisation.
The rest pool service gets blocked after this error appears (it's related to the clearNLP part for extracting the substance) : `Mar 07, 2018 10:30:25 AM org.grobid.core.engines.DefaultSubstanceParser parseSubstance SEVERE: error...
The word to number conversion works well, however there are few cases where the conversion is not working properly. The sentences is `VAL person were missing. One question was raised.`,...