dateparser icon indicating copy to clipboard operation
dateparser copied to clipboard

can't parse Korean datestring in specific cases

Open aquatype opened this issue 2 years ago • 1 comments

Seems like parsing is being broken if the datestring contains year/month/day keywords in Korean(//).

print(dateparser.__version__)
>>> 1.1.7

print(dateparser.parse('2023년 2월 10일 오전 3시 30분', locales=['ko']))
>>> None
print(dateparser.parse('2000년 1월 1일 19:30', locales=['ko']))
>>> None
print(dateparser.parse('1970년 1월 1일', locales=['ko']))
>>> None
print(dateparser.parse('12월 31일', locales=['ko']))
>>> None

Other cases, including relative keywords, works fine.

print(dateparser.parse('2023-02-10 03:30', locales=['ko']))
>>> 2023-02-10 03:30:00
print(dateparser.parse('오늘', locales=['ko']))  # today
>>> 2023-02-07 13:47:01.837601
print(dateparser.parse('내일', locales=['ko']))  # tomorrow
>>> 2023-02-08 13:47:01.838223
print(dateparser.parse('3시간 전', locales=['ko']))  # 3 hours ago
>>> 2023-02-07 10:47:01.838705

Also tried DateDataParser but no luck.

print(dateparser.date.DateDataParser(locales=['ko']).get_date_data('2023년 2월 10일 오전 3시 30분'))
>>> DateData(date_obj=None, period='day', locale=None)

aquatype avatar Feb 07 '23 05:02 aquatype