Andrei Paraschiv

Results 154 comments of Andrei Paraschiv

Yeah, there is a problem. It seems that **bbc.com** is now just dynamically rendered, there page is constructed with javascript after it loads. Here, you can see that there are...

Hi, this behavior is "as expected", but I agree, it's not optimal. The problem it addresses is that some sites prepend the website name to the title (or postpend) In...

**Comment by [racindustries](https://github.com/racindustries)** _Tue Oct 16 07:45:48 2018_ ---- Hi PrajP, It's only a backup solution of course, but here's how I proceeded to reduce my results to health category...

Hi, can you try out with the new release (0.9.3)? Also, make sure you test with memorize_articles=False? https://newspaper4k.readthedocs.io/en/latest/user_guide/api_reference.html?highlight=memorize_#newspaper.configuration.Configuration.memorize_articles just set it in config, or as a parameter for the Source...

**Comment by [aussetg](https://github.com/aussetg)** _Wed Mar 6 10:29:38 2019_ ---- If you're introducing Spacy as a dependency then might as well replace NLTK with Spacy too. I think I'm going to...

**Comment by [mcpeixoto](https://github.com/mcpeixoto)** _Thu Jun 27 11:31:06 2019_ ---- I'm also looking for that!

**Comment by [congthinh](https://github.com/congthinh)** _Wed May 27 07:57:46 2020_ ---- > Is it possible to create a function within the newspaper API to get a category for a specific article based...

**Comment by [startupflux](https://github.com/startupflux)** _Sat Nov 18 10:28:50 2017_ ---- I see the same issue and not sure how to fix this. newspaper is picking up authors from the 'Read more'...

i think this is related to #639 The pull request by @changchiyou should fix this

**Comment by [mbahmani](https://github.com/mbahmani)** _Tue Apr 20 16:17:44 2021_ ---- This is my question too. also, how we can evaluate the result for the summary and keywords?