visidata
visidata copied to clipboard
[html] Opening up an html table with interspersed headers
<Notkea>
hello, I'm trying to extract data from an HTML table which does not have a header row. I end up with a single column and many empty rows (containing NoneType objects). Any hint of how I could get the data in the cells? The document looks like this: vd "https://webshop.calestor-periway.fr/product/Moniteurs-TV/Moniteurs/Samsung/Samsung-C49J890DKR-cran-LED-incurv-49-?searchtrack=ProductList&prodid=1437755&info=2"
--header 0
does not seem to help. Opening an issue to investigate for when we have more focused time. Question was originally asked on #visidata
.
The table structure is <tr>
alternately containing <th>
/<td>
tags. The html loader will have to do something different in this particular case.