newspaper issues

Results 152 newspaper issues

Sort by recently updated

Add support for Latvian languaga

Stopword list from https://countwordsfree.com/stopwords/latvian

Is there a way to increase read timeout for article download?

Hi, I am attempting to use newspaper to download many articles and do not want the timeout window to be set at 7 seconds. Is there any way either within...

5465869

newspaper.nlp | ignored stopword: using

I noticed that the nlp loading the stopwords and as well as the stopwords-en.txt including 'using'. however i still see n3k returning 'using' as a keyword for the below pages....

ilkut

passing page sourse(html) instead of url

i want to use newspaper lib. but instead of use it by passing url of article i want to to pass article page sourse. Is there any way I can...

akashmondal1810

Does not work with nytimes?

Describe the bug Whenever I tried to extract contents from NYTimes articles, they are random and incomplete. I tried on Newspaper Demo page as well for NYTimes articles and I...

yingyingww

Add Latvian language support

Adding Latvian language support. Tested locally and already started working on a personal project using the newly added language, everything seems to be working as expected for me 👍 -...

kaspars-gailitis

Replace jieba3k with jieba

[jieba3k](https://pypi.python.org/pypi/jieba3k) package is outdated. It is last updated in 2014. Main [jieba](https://pypi.python.org/pypi/jieba) is Python3 compatible since 2015.

pistolero

enhancement

Adding Bengali Language Support

I incorporated the Bengali tokenizer from cltk, and an open source Bengali stopword list, and updated everything per the instructions. I also tested it locally and all seems to work.

lookatmeimdanny

Add article to node search tags

* The tag often denotes exactly where the article begins and ends in HTML5. I noticed the wrong text was being pulled from articles on the bbc.co.uk. This includes whole...

8W9aG

Cannot transverse from clean_top_node to clean_doc or doc

Perhaps misunderstand the relationship from clean_top_node to clean_doc or doc, but cannot transverse from clean_top_node to clean_doc or doc. For example, following will not work. a = Article('https://somesite.com/some_article') a.download() a.parse()...

monstrfolk

newspaper
newspaper copied to clipboard

Metadata

Add support for Latvian languaga

Is there a way to increase read timeout for article download?

newspaper.nlp | ignored stopword: using

passing page sourse(html) instead of url

Does not work with nytimes?

Add Latvian language support

Replace jieba3k with jieba

Adding Bengali Language Support

Add article to node search tags

Cannot transverse from clean_top_node to clean_doc or doc

← Metadata

Owner

Metadata

newspaper newspaper copied to clipboard

Metadata

← Metadata

Owner

Metadata

newspaper
newspaper copied to clipboard