grobid-quantities icon indicating copy to clipboard operation
grobid-quantities copied to clipboard

Annotation of currencies

Open everzeni opened this issue 7 years ago • 4 comments

Even in scientific papers, we may stumble upon expressions of amounts of money (150$, 3€).

Do we annotate $, , etc?

everzeni avatar Feb 27 '18 13:02 everzeni

Honestly not a clue... we could annotate them with type CURRENCY perhaps and have the automatic transformation based on the exchange rate, but I'm not sure we have done it before.

@kermitt2 what do you think?

lfoppiano avatar Feb 28 '18 16:02 lfoppiano

yes it makes definitively sense to annotate them with the currency as unit and using a type CURRENCY. So far I didn't see such expression of money in the first batch of training data.

kermitt2 avatar Feb 28 '18 18:02 kermitt2

here is an example from halshs-01279855.training.tei.xml:

the French professional football league broadcasting rights over <measure type="value">
<num>one</num></measure> season (<measure type="value"><measure type="CURRENCY" 
unit="€">€</measure> <num>668 million</num></measure>)

everzeni avatar Mar 05 '18 14:03 everzeni

unit="€" I would write something like unit="euro" instead

lfoppiano avatar Mar 05 '18 15:03 lfoppiano