Recognizers-Text icon indicating copy to clipboard operation
Recognizers-Text copied to clipboard

[ZH NumberRange]"十来"、"二十来"、"十来万"、"二百三十来万"、"肆拾來億"、"500來億" etc. can't be recognized as numberrange

Open SoaringTiger opened this issue 4 years ago • 2 comments

Describe the bug "十来"、"二十来"、"十来万"、"二百三十来万"、"肆拾來億"、"500來億" etc . in Chinese mean "more than X"

Expected input/output

{
    "Text": "十来",
    "Start": 0,
    "End": 1,
    "TypeName": "numberrange",
    "Resolution": {
      "value": "(10,)"
    }
  }

{
    "Text": "二十来",
    "Start": 0,
    "End": 2,
    "TypeName": "numberrange",
    "Resolution": {
      "value": "(20,)"
    }
  }

{
    "Text": "十来万",
    "Start": 0,
    "End": 2,
    "TypeName": "numberrange",
    "Resolution": {
      "value": "(100000,)"
    }
  }

{
    "Text": "二百三十来万",
    "Start": 0,
    "End": 5,
    "TypeName": "numberrange",
    "Resolution": {
      "value": "(2300000,)"
    }
  }

{
    "Text": "肆拾來億",
    "Start": 0,
    "End": 3,
    "TypeName": "numberrange",
    "Resolution": {
      "value": "(4000000000,)"
    }
  }

{
    "Text": "500來億",
    "Start": 0,
    "End": 4,
    "TypeName": "numberrange",
    "Resolution": {
      "value": "(50000000000, )"
    }
  }

Platform (please complete the following information):

  • Platform: [.NET ...]
  • Version of package [v1.7.0]

SoaringTiger avatar Jun 25 '21 04:06 SoaringTiger

Wouldn't a more accurate interpretation of it in Chinese mean "a little more than X"? In our view, "十来" -> "(10,15)".

tellarin avatar Jun 30 '21 06:06 tellarin

Wouldn't a more accurate interpretation of it in Chinese mean "a little more than X"? In our view, "十来" -> "(10,15)".

😊 In my view

一百多 ->  (100, 200)
十几亿 ->  (1000000000, 2000000000)

But we can determine the lower limit of the numberrange.

SoaringTiger avatar Jun 30 '21 06:06 SoaringTiger