JioNLP
JioNLP copied to clipboard
[BUG]jionlp.ner.extract_time()解析英文逗号的bug
jionlp.ner.extract_time("每天早上八点,他去上学", time_base=time.time())解析正确
jionlp.ner.extract_time("每天早上八点,他去上学", time_base=time.time())解析错误
[{'time_candidate': '每天早上八点,', 'offset': [0, 7]}]
2023-08-28 12:19:43 ERROR parse_time_period: Traceback (most recent call last):
File "/home/stardust/anaconda3/envs/rasa/lib/python3.10/site-packages/jionlp/gadget/time_parser.py", line 1209, in parse_time_period
first_full_time_handler, second_full_time_handler, _, blur_time = self.parse_time_span_point(
File "/home/stardust/anaconda3/envs/rasa/lib/python3.10/site-packages/jionlp/gadget/time_parser.py", line 1048, in parse_time_span_point
blur_time = self.parse_time_point(
File "/home/stardust/anaconda3/envs/rasa/lib/python3.10/site-packages/jionlp/gadget/time_parser.py", line 1521, in parse_time_point
raise ValueError('## exception string {}
.'.format(time_string))
ValueError: ## exception string 早上八点,
.
看上去可能是正则表达式没考虑英文逗号?我没找到应该修改的地方,还请指点。