Recognizers-Text icon indicating copy to clipboard operation
Recognizers-Text copied to clipboard

[ZH Currency] "三十几万元"、"二百余万元"、"一千多万元" recognized incorrectly

Open SoaringTiger opened this issue 4 years ago • 0 comments

Describe the bug "三十几万元"、"二百余万元"、"一千多万元" will be misinterpreted while "三十几元"、"二百余元"、"一千多元" can't be recognized.

To Reproduce

三十几万元 ≠ 三十万元

{
  "Text": "三十几万元",
  "Start": 0,
  "End": 4,
  "TypeName": "currency",
  "Resolution": {
    "isoCurrency": "CNY",
    "unit": "Chinese yuan",
    "value": "300000"
  }
}

二百余万元 ≠ 二百万元

{
  "Text": "二百余万元",
  "Start": 0,
  "End": 4,
  "TypeName": "currency",
  "Resolution": {
    "isoCurrency": "CNY",
    "unit": "Chinese yuan",
    "value": "2000000"
  }
}

一千多万元 ≠ 一千万元

{
  "Text": "一千多万元",
  "Start": 0,
  "End": 4,
  "TypeName": "currency",
  "Resolution": {
    "isoCurrency": "CNY",
    "unit": "Chinese yuan",
    "value": "10000000"
  }
}

{
  "Text": "一千万元",
  "Start": 0,
  "End": 3,
  "TypeName": "currency",
  "Resolution": {
    "isoCurrency": "CNY",
    "unit": "Chinese yuan",
    "value": "10000000"
  }
}

Expected behavior May we add new Type "currencyrange" ?

{
  "Text": "一千多万元",
  "Start": 0,
  "End": 4,
  "TypeName": "currencyrange",
  "Resolution": {
    "isoCurrency": "CNY",
    "unit": "Chinese yuan",
    "value": "(10000000,)"
  }
}

Platform (please complete the following information):

  • Platform: [.NET ...]
  • Version of package [v1.7.0]

SoaringTiger avatar Jun 30 '21 15:06 SoaringTiger