newspaper issues

Results 152 newspaper issues

Sort by recently updated

I get back the same description on all these links, although they are clearly different.

They all return the same description for me when running on a local environment in Windows as well as ec2 linstance in linux. > ["https://fortune.com/2022/05/01/germany-says-dependence-on-russian-oil-could-end-in-late-summer/", > "https://fortune.com/2022/05/01/raytheon-union-reach-labor-deal-at-key-jet-engine-plants/", > "https://fortune.com/2022/05/01/bored-ape-metaverse-frenzy-raises-millions-crashes-ethereum/", >...

gaurav-95

Please make sure your browser supports JavaScript and cookies and that you are not blocking them from loading

Thank you for your awesome work. We have an issue when we try to crawl sites with cookies/javascript alerts. E.g https://www.bloomberg.com/news/articles/2020-08-03/softbank-backed-grab-snags-200-million-from-private-equity-firm We can only get an alternate version of the...

ngocdb

After downloading a few hundred articles it mass fails

So I am using newspaper3k to mass download articles while scraping Google, I noticed that after a couple of hours of downloading hundreds of different articles it continuously gives me...

steeljardas

It turns out that a lot of sites do not work with

I am completely disenchanted. Why these dictionaries, key stop words? From many sites, instead of the text of the article, there is an empty line. I definitely didn't expect this.

alekssamos

Close opened category cache files when done

## This PR Proposes: - Add file.close() to category cache files after use Addresses issue: #843

WillGITCode

404 Client Error

Hi, When I run your example code, why did I get the 404 Client Error which indicates: CRITICAL:newspaper.network:[REQUEST FAILED] 404 Client Error: Not Found for url: http://edition.cnn.com/feeds CRITICAL:newspaper.network:[REQUEST FAILED] 404...

Zhujunnan

enhancement

newspaper.build find 0 articles

url: https://ec.europa.eu/commission/presscorner/home/en obviously there is many news on this web plz help..

saha65536

Should newspaper3k bypass a wall on ft.com or medium.com?

This issue asks about specification of newspaper3k. Some media company page (eg ft.com and medium.com) has a wall. newspaper3k doesn't go beyond. For example, when you parse ```https://www.ft.com/content/2f081189-01dd-4549-a6b0-ab4f04a103cd```, you get...

nwatab

Update in PUBLISH_DATE_TAGS for date extraction

minor changes in PUBLISH_DATE_TAGS for extraction of date. supporting date extraction form websites like ndtv, zee news Url to test - "https://www.ndtv.com/world-news/kamala-harris-mention-of-indian-jamaican-parents-in-first-us-election-speech-2278638?pfrom=home-topscroll"

gavishpoddar

Unable to pull articles from list of article URL's

I have stored a list of previously used articles downloaded from various sites as URL's in a list, trying to iterate through the list to download each one but throws...

Unique201

newspaper
newspaper copied to clipboard

Metadata

I get back the same description on all these links, although they are clearly different.

Please make sure your browser supports JavaScript and cookies and that you are not blocking them from loading

After downloading a few hundred articles it mass fails

It turns out that a lot of sites do not work with

Close opened category cache files when done

404 Client Error

newspaper.build find 0 articles

Should newspaper3k bypass a wall on ft.com or medium.com?

Update in PUBLISH_DATE_TAGS for date extraction

Unable to pull articles from list of article URL's

← Metadata

Owner

Metadata

newspaper newspaper copied to clipboard

Metadata

← Metadata

Owner

Metadata

newspaper
newspaper copied to clipboard