grobid-quantities
grobid-quantities copied to clipboard
Annotation of currencies
Even in scientific papers, we may stumble upon expressions of amounts of money (150$
, 3€
).
Do we annotate $
, €
, etc?
Honestly not a clue... we could annotate them with type CURRENCY
perhaps and have the automatic transformation based on the exchange rate, but I'm not sure we have done it before.
@kermitt2 what do you think?
yes it makes definitively sense to annotate them with the currency as unit and using a type CURRENCY
. So far I didn't see such expression of money in the first batch of training data.
here is an example from halshs-01279855.training.tei.xml
:
the French professional football league broadcasting rights over <measure type="value">
<num>one</num></measure> season (<measure type="value"><measure type="CURRENCY"
unit="€">€</measure> <num>668 million</num></measure>)
unit="€"
I would write something like unit="euro"
instead