newspaper4k
newspaper4k copied to clipboard
Bug fixes and updates to work for ET, TOI
Issue by jasoriya
Fri Apr 13 08:40:43 2018
Originally opened as https://github.com/codelucas/newspaper/pull/552
Hello, I was working on parsing some India newspaper sites like Times of India, Economics Times and others.
- I had to add additional values of attributes for parsing author names.
- For publishedDate, one addition was required in PUBLISH_DATE_TAGS.
- At line 195, I added an additional exception that occurred sometimes while parsing ET articles.
Please let me know any additional details you may require as this is my first pull request.
jasoriya included the following code: https://github.com/codelucas/newspaper/pull/552/commits