html-table-extractor
                                
                                
                                
                                    html-table-extractor copied to clipboard
                            
                            
                            
                        extract data from html table
I have encountered multiple bugs due to supporting backward compatibility of python2, and it makes development extremely hard since I have to hack the code to make it work for...
Encoding
Gives and error stating 'ascii' codec can't encode character u'\xe9'
Would be a good feature to extract the the rows as a dict, at least from simple tables. Example: | A | B | C | | 1 | 2...
I am not sure if this is really an issue with the parser but perhaps an improvement request unless a solution is available when using this parser. Consider a complex...
your package works great but I had to modify it slightly.. ``` self._insert(row_ind, col_ind, row_span, col_span, self._transformer(cell.get_text())) ``` This is fine if the content is text but if it contains...
Hi, it would be nice if you supported the parameters of the beautifulsoup get_text method, namely 'separator' and 'strip'. See the BS docs here - https://beautiful-soup-4.readthedocs.io/en/latest/index.html?highlight=get_text#get-text These could be added...