Recognizers-Text
Recognizers-Text copied to clipboard
[ZH NumberRange]"十来"、"二十来"、"十来万"、"二百三十来万"、"肆拾來億"、"500來億" etc. can't be recognized as numberrange
Describe the bug "十来"、"二十来"、"十来万"、"二百三十来万"、"肆拾來億"、"500來億" etc . in Chinese mean "more than X"
Expected input/output
{
"Text": "十来",
"Start": 0,
"End": 1,
"TypeName": "numberrange",
"Resolution": {
"value": "(10,)"
}
}
{
"Text": "二十来",
"Start": 0,
"End": 2,
"TypeName": "numberrange",
"Resolution": {
"value": "(20,)"
}
}
{
"Text": "十来万",
"Start": 0,
"End": 2,
"TypeName": "numberrange",
"Resolution": {
"value": "(100000,)"
}
}
{
"Text": "二百三十来万",
"Start": 0,
"End": 5,
"TypeName": "numberrange",
"Resolution": {
"value": "(2300000,)"
}
}
{
"Text": "肆拾來億",
"Start": 0,
"End": 3,
"TypeName": "numberrange",
"Resolution": {
"value": "(4000000000,)"
}
}
{
"Text": "500來億",
"Start": 0,
"End": 4,
"TypeName": "numberrange",
"Resolution": {
"value": "(50000000000, )"
}
}
Platform (please complete the following information):
- Platform: [.NET ...]
- Version of package [v1.7.0]
Wouldn't a more accurate interpretation of it in Chinese mean "a little more than X"? In our view, "十来" -> "(10,15)".
Wouldn't a more accurate interpretation of it in Chinese mean "a little more than X"? In our view, "十来" -> "(10,15)".
😊 In my view
一百多 -> (100, 200)
十几亿 -> (1000000000, 2000000000)
But we can determine the lower limit of the numberrange.