Recognizers-Text
Recognizers-Text copied to clipboard
[ZH Currency] "三十几万元"、"二百余万元"、"一千多万元" recognized incorrectly
Describe the bug "三十几万元"、"二百余万元"、"一千多万元" will be misinterpreted while "三十几元"、"二百余元"、"一千多元" can't be recognized.
To Reproduce
三十几万元 ≠ 三十万元
{
"Text": "三十几万元",
"Start": 0,
"End": 4,
"TypeName": "currency",
"Resolution": {
"isoCurrency": "CNY",
"unit": "Chinese yuan",
"value": "300000"
}
}
二百余万元 ≠ 二百万元
{
"Text": "二百余万元",
"Start": 0,
"End": 4,
"TypeName": "currency",
"Resolution": {
"isoCurrency": "CNY",
"unit": "Chinese yuan",
"value": "2000000"
}
}
一千多万元 ≠ 一千万元
{
"Text": "一千多万元",
"Start": 0,
"End": 4,
"TypeName": "currency",
"Resolution": {
"isoCurrency": "CNY",
"unit": "Chinese yuan",
"value": "10000000"
}
}
{
"Text": "一千万元",
"Start": 0,
"End": 3,
"TypeName": "currency",
"Resolution": {
"isoCurrency": "CNY",
"unit": "Chinese yuan",
"value": "10000000"
}
}
Expected behavior May we add new Type "currencyrange" ?
{
"Text": "一千多万元",
"Start": 0,
"End": 4,
"TypeName": "currencyrange",
"Resolution": {
"isoCurrency": "CNY",
"unit": "Chinese yuan",
"value": "(10000000,)"
}
}
Platform (please complete the following information):
- Platform: [.NET ...]
- Version of package [v1.7.0]