html2csv
html2csv copied to clipboard
A utility that extracts tables from HTML documents and converts them to CSV format
Unlike pathlib, BeautifulSoup can guess and handle several text codecs so we let it work its magic Addresses issue #5
Been pending with the `convert_to_list` function added in #10: https://github.com/hanwentao/html2csv/compare/v0.1.3...master
It would be handy if we could pass the quoting values into this tool: https://docs.python.org/3/library/csv.html#csv.QUOTE_ALL I think you could do something similar for setting the delimiter too (which you already...
Support CSS selector to pick tables out of HTML documents.
Interestingly enough someone found a [security issue](http://www.antihackingonline.com/potential-risk-of-cve/cve-2021-23654-this-affects-all-versions-of-package-html-to-csv-the-flaw-let-threat-actor-can-embed-or-generate-a-malicious-link-or-execute-commands-via-csv-files-26-11-2021/) in this code but failed to post a pull-request or even report it: https://security.snyk.io/vuln/SNYK-PYTHON-HTMLTOCSV-1582784
Support other delimiters like tabs, spaces, etc. Issue from Nicolás Gaete Tan by email
I am converting a webpage which is not utf8. There's no way to specify an input encoding to your tool, so only utf8 is permissible. Most of my favorite webpage...
Hi, Love this project, it is very handy. I noticed that the tool skips lines of a table that have empty cells in the line. Any way we can remedy...