Adrian Breiding
Adrian Breiding
Redoes and Closes #409 in a clean branch
As requested by reviewer in #392 the functionality is split into separate PRs. This PR is based on #403 and should not be merged before it.
In case certain publishers pass on the language of the article it should be possible to overwrite the auto-generated lang attribute of the article. This might make sense in cases...
This adds the first indian newspaper. Unfortunately the default country code (in) is a python keyword, which is why I chose to use the three letter alternative.
This PR adds `KR`
This PR adds the first Russian Newspaper. Note that due to a bug in the native RobotsParser, you need to ignore the robots file to crawl this publisher.