Andrei Paraschiv

Results 95 issues of Andrei Paraschiv

**Issue by [cepesh](https://github.com/cepesh)** _Sun Oct 8 17:45:03 2017_ _Originally opened as https://github.com/codelucas/newspaper/issues/457_ ---- Hiya, None of the following links are parsed correctly. All I get in .text is "The latest...

sites not working

**Issue by [ZeeshanSultan](https://github.com/ZeeshanSultan)** _Mon May 28 13:26:47 2018_ _Originally opened as https://github.com/codelucas/newspaper/issues/572_ ---- https://github.com/codelucas/newspaper/blob/c521057b20bb3d4cd27d8b0ee6efd64d1d3a488f/newspaper/urls.py#L239 The validator uses blacklist based filters to detect bad urls and then whitelist based filter to...

sites not working

**Issue by [leesamu](https://github.com/leesamu)** _Tue Jul 24 21:57:52 2018_ _Originally opened as https://github.com/codelucas/newspaper/issues/600_ ---- Newspaper has been working better for me than any other substitute I can find. I'm using it...

sites not working

**Issue by [tomthebuzz](https://github.com/tomthebuzz)** _Wed Jul 18 14:43:41 2018_ _Originally opened as https://github.com/codelucas/newspaper/issues/596_ ---- Like your work a lot and would like to expand on it. Did you ever think about...

enhancement

**Issue by [xoffey](https://github.com/xoffey)** _Thu Jul 26 00:07:22 2018_ _Originally opened as https://github.com/codelucas/newspaper/issues/601_ ---- In a dataset that included 986 articles from LA Times, 443 (44.9%) of the LA Times articles...

sites not working

**Issue by [bq-chen](https://github.com/bq-chen)** _Mon Feb 11 18:55:59 2019_ _Originally opened as https://github.com/codelucas/newspaper/issues/676_ ---- I've tried a Chinese news link, but I get nothing as top_image or even the text of...

sites not working

**Issue by [christinac](https://github.com/christinac)** _Sun Feb 12 19:03:22 2017_ _Originally opened as https://github.com/codelucas/newspaper/issues/333_ ---- Right now, an article's `publish_date` returns a datetime object with a month, day, and year. It would...

enhancement

**Issue by [mamoit](https://github.com/mamoit)** _Fri Jul 21 10:37:52 2017_ _Originally opened as https://github.com/codelucas/newspaper/issues/403_ ---- Spec defines it [here](http://ogp.me/#structured). I think we should scrape `og:image:secure_url` and fallback to `og:image:url` if the first...

enhancement

**Issue by [somnathrakshit](https://github.com/somnathrakshit)** _Fri Jul 21 14:40:55 2017_ _Originally opened as https://github.com/codelucas/newspaper/issues/405_ ---- The incorrect text is being extracted from these two links at [http://newspaper-demo.herokuapp.com](http://newspaper-demo.herokuapp.com) for these web pages: 1....

sites not working

**Issue by [kaundinya5](https://github.com/kaundinya5)** _Wed Jan 10 10:44:00 2018_ _Originally opened as https://github.com/codelucas/newspaper/issues/503_ ---- Is there a way I can get results from paginated websites?

enhancement