warn-scraper icon indicating copy to clipboard operation
warn-scraper copied to clipboard

WI scraper text parsing weirdness

Open stucka opened this issue 5 months ago • 0 comments

Some junky HTML is coming in with a United States Cellular Corporation entry; but if I try to replace the text or split on it within _clean_text, it fails. If I try to even just log lines with "Cellular" or "Corporation" I don't see them. I don't know if there's a Unicode vs. ASCII thing or something cooking here, the actual CSV output has that cell wrapped in regular quote marks, though the HTML inside is unescaped and contains several quote marks.

I tried.

stucka avatar Jul 16 '25 01:07 stucka