newspaper issues

Results 152 newspaper issues

Sort by recently updated

can't start new thread

Sorry, my English is not good, I will try to be as clear as possible I used 3 servers to run my program, but there are still errors like `error:...

liuying12138

not finding embedded youtube videos

I'm trying to scrape youtube videos from this link (https://lifehacker.com/the-best-diy-youtube-channels-to-turn-you-into-a-fix-it-1699686543), I'm successfully able to get the images, title and text but for some reason, I'm not able to get any...

kaundinya5

fix(sec): upgrade nltk to 3.6.6

### What happened？ There are 1 security vulnerabilities found in nltk 3.2.1 - [MPS-2022-15003](https://www.oscs1024.com/hd/MPS-2022-15003) ### What did I do？ Upgrade nltk from 3.2.1 to 3.6.6 for vulnerability fix ### What...

chncaption

fix(sec): upgrade requests to 2.20

### What happened？ There are 1 security vulnerabilities found in requests 2.10.0 - [CVE-2018-18074](https://www.oscs1024.com/hd/CVE-2018-18074) ### What did I do？ Upgrade requests from 2.10.0 to 2.20 for vulnerability fix ### What...

chncaption

Would not load custom feed articles

I was having difficulting getting articles from a site and noticed that It kept dumping my custom feed extensions. I found that the problem was It was memoizing the feed...

Coinjuice

memoize_articles=False still caches

Setting memoize_articles to False still caches articles. The docs say that setting it to False shouldn't cache anything. This can cause problems when scraping a site such as wayback machine....

N8Brooks

Blogger / Blogspot issue

Some blogspot / blogger sites don't seem to parse: here is an example: `from newspaper import Article url = 'http://www.righto.com/2011/07/cells-are-very-fast-and-crowded-places.html' article = Article(url) article.download() article.parse() print(article.text)` this prints ""

ontopicprojects

newspaper
newspaper copied to clipboard

Metadata

can't start new thread

not finding embedded youtube videos

fix(sec): upgrade nltk to 3.6.6

fix(sec): upgrade requests to 2.20

Would not load custom feed articles

memoize_articles=False still caches

Blogger / Blogspot issue

fix itemprop containing articleBody

ContentExtractor.nodes_to_check doesn't recognize the "right" <p> elements in html article

bengali support added

← Metadata

Owner

Metadata

newspaper newspaper copied to clipboard

Metadata

← Metadata

Owner

Metadata

newspaper
newspaper copied to clipboard