Andrei Paraschiv

Results 154 comments of Andrei Paraschiv

Hi @queukat ! Thanks for your contribution! What I wonder is - why create a new class and not extend the functionality here: https://github.com/AndyTheFactory/newspaper4k/blob/c5e4170918a6d1e99cb1bab6fd188ee8ed5a2afa/newspaper/extractors/metadata_extractor.py#L164-L174

Hi @queukat Thanks for your compelling arguments. You are right, it would make sense to have the tags in their own extractor. Would it not make sense to move the...

**Comment by [johnbumgarner](https://github.com/johnbumgarner)** _Sat Feb 26 05:43:47 2022_ ---- What are some of the sites that aren't extracting for you?

**Comment by [Baytars](https://github.com/Baytars)** _Fri Apr 15 15:33:25 2022_ ---- Pubmed. > What are some of the sites that aren't extracting for you?

**Comment by [tspier](https://github.com/tspier)** _Sun Jul 18 23:15:56 2021_ ---- Maybe one of the two files here? https://github.com/codelucas/newspaper/tree/master/newspaper/resources/misc

Unfortunatelly the cloudscraper does not wrok in 100% of cases. In this cases you should use playwright or similar. [here an example](https://newspaper4k.readthedocs.io/en/latest/user_guide/examples.html#using-playwright-to-scrape-websites-built-with-javascript):

**Comment by [naivelogic](https://github.com/naivelogic)** _Mon Aug 27 02:14:17 2018_ ---- Yes, i am currently having the same problem. the only workable solution i could find was to manually go to the...

**Comment by [codelucas](https://github.com/codelucas)** _Mon Aug 27 07:07:48 2018_ ---- Thanks for filing this @tomthebuzz and also @naivelogic. If what you guys are reporting is true then this seems to be...

**Comment by [tomthebuzz](https://github.com/tomthebuzz)** _Mon Aug 27 08:32:27 2018_ ---- Hi Lucas, unfortunately it does not reproduce constantly. While iterating in 10min intervals I have 7-8 out 10 that show this...

**Comment by [codelucas](https://github.com/codelucas)** _Mon Sep 3 06:17:50 2018_ ---- The memoization behavior is beginning to get very annoying since a lot of users are reporting issues with the api out...