grobid-quantities
grobid-quantities copied to clipboard
GROBID extension for identifying and normalizing physical quantities.
What do you think about the gigabit, like in `2.0 Gb RAM`? I annotated it as unknown unit type: ```xml 2.0 Gb RAM ``` Is it ok?
In the following sentence with a [zenithal hourly rate](https://en.wikipedia.org/wiki/Zenithal_hourly_rate) quantity: > the meteor shower at Mars is an Earth-equivalent zenith hourly rate 600 h −1 Should we annotate: ```xml (1)...
To know what form a @unit attribute's value must take (for example `unit="min"` or `unit="minute"`?), we can use this page http://cdsarc.u-strasbg.fr/cgi-bin/Unit?%3f Is that ok? Are there other sources we could...
As we moved from "lexical mapping" (not to say rules!) to a CRF parser to process and normalize the unit expressions, the full name unit are not covered by the...
For example "_twenty_ kilos" - currently the recognition is very bad and there is no normalization into numerical values. We should: - add a matching feature for this very limited...
Dates are _time measurement_, so in the scope of the tool. Currently the recognition of dates and time expressions is limited and there is no normalization for these expressions. -...
Hi @khundman and @chrismattmann, I'm currently working with @kermitt2 to the GROBID quantity model. I was wondering if you had time to have a look at the demo, if you...
This PR will select some paper to have an holdout set. At the moment, as the data set is small, we will use all the documents for create the final...
As discussed in previous issues, we need a proper method for disambiguating the units in context
We need to establish an holdout dataset that allow a more precise way to evaluate.