warn-scraper
warn-scraper copied to clipboard
WI scraper text parsing weirdness
Some junky HTML is coming in with a United States Cellular Corporation entry; but if I try to replace the text or split on it within _clean_text, it fails. If I try to even just log lines with "Cellular" or "Corporation" I don't see them. I don't know if there's a Unicode vs. ASCII thing or something cooking here, the actual CSV output has that cell wrapped in regular quote marks, though the HTML inside is unescaped and contains several quote marks.
I tried.