Gavish Poddar

Results 7 issues of Gavish Poddar

minor changes in PUBLISH_DATE_TAGS for extraction of date. supporting date extraction form websites like ndtv, zee news Url to test - "https://www.ndtv.com/world-news/kamala-harris-mention-of-indian-jamaican-parents-in-first-us-election-speech-2278638?pfrom=home-topscroll"

# Reimplementing and simplifying `search_dates` A reimplemented and simplified `search_dates` which more directly uses `dateparser.parse`, improves accuracy and fixes many bugs New Feature: - `search_first_date` - searches and returns the...

This PR solves the below issues, it adds support for date objects like `12.12.2000` (separated by `.`) - #768 - #771 - #500 - #952 - #767 Tests are added...

- Updating CLDR data to `39.0.0`. - Fixing CLDR downlaod error. - Updating CLDR Data URL : `https://github.com/unicode-cldr/cldr-dates-full` (archived) -> `https://github.com/unicode-org/cldr-json`. TODO : - Fixing tests Fixes issue #940

This PR Fixes, - Fixing E402 module-level import, not at top of file in `dateparser/docs/conf.py` - Fixing UnicodeDecodeError: 'charmap' codec can't decode byte 0x8f in position 1442: character maps to...

`search_dates` is having issues with sentence splitting in `_sentence_split` [link](https://github.com/scrapinghub/dateparser/blob/ad50964fc319a26ed69300937354f59332d415a3/dateparser/languages/locale.py#L234) This issue is common in many issues [masked here](https://github.com/scrapinghub/dateparser/pull/945#issuecomment-881762464) referred as related issue 1. Returns broken results limited by period...

CLDR data doesn't seem to be updated and languages like `as` are not translated properly. Additionally, CLDR data URL is archived: `https://github.com/unicode-cldr/cldr-dates-full` (archived) Thanks