python-readability icon indicating copy to clipboard operation
python-readability copied to clipboard

fast python port of arc90's readability tool, updated to match latest readability.js!

Results 43 python-readability issues
Sort by recently updated
recently updated
newest added

```python >>> u = "https://www.geeksforgeeks.org/samsung-research-institute-bangalore-srib-intern/" >>> import requests >>> r = requests.get(u) >>> from readability import Document >>> doc = Document(r.content) >>> doc.get_clean_html() Traceback (most recent call last): File "",...

Summary() seem don't work on website where text is spliting() in many tag . I encoutered this problem specifically on this web site : [https://start.lesechos.fr/actu-entreprises/services/a-19-ans-il-est-le-plus-jeune-patissier-prime-au-guide-michelin-13983.php](https://start.lesechos.fr/actu-entreprises/services/a-19-ans-il-est-le-plus-jeune-patissier-prime-au-guide-michelin-13983.php) ```python url = "https://start.lesechos.fr/actu-entreprises/services/a-19-ans-il-est-le-plus-jeune-patissier-prime-au-guide-michelin-13983.php" page...

The summary method removes non breaking space (` `) instead of inserting a simple space or keeping it. Example: ``` Dann haben wir ein unschlagbares Angebot für Sie! ``` gets to ``` DannhabenwireinunschlagbaresAngebotfürSie! ``` Full html file is...

http://www.chinanews.com/ga/2016/11-14/8062393.shtml,http://ent.sina.com.cn/s/m/2016-11-14/doc-ifxxsmic6238512.shtml?qq-pf-to=pcqq.c2c

Thank you for keeping up the project! I use readability to extract the article and then save it as html. Today I've run into problem when Chrome didn't display some...

Hi @buriy, do you wake up? It seems I always post when you go to sleep, the dammit timezone... Ok, I come up a thought : List all node absolute...

Hi there, I have a request that shouldn't be too hard to make, though might require some trickery for a proper full CSS solution. If you look at the code...

- Fixes support for `lxml>=5.2.0` - Adds explicit support for current python versions - Removes support for EOL'd python versions - Removes support for EOL'd platforms - Fixes build for...