warn-scraper icon indicating copy to clipboard operation
warn-scraper copied to clipboard

Incomplete data coming through

Open stucka opened this issue 2 years ago • 1 comments

Bot sent through something with no state postal code, even CHARTER COMMUNICATIONS LLC - CENTRAL REGION Notice date: 2023-07-26 FALLON HEALTH WEINBERG, INC. - WESTERN REGION Notice date: 2023-07-20

stucka avatar Aug 15 '23 20:08 stucka

The initial problem is likely one with warn-bot, but it highlighted another.

New York's scraper is combining fields, such as Charter Communications LLC - Central Region

Company name can be separated as " - ".join(companyname.split(" - ")[:-1]

Region name can be separated as companyname.split(" - ")[-1] or companyname.split(" - ")[-1].replace(" Region", "")

Not sure if that would duplicate data n the database as the company name would shift considerably.

stucka avatar Aug 16 '23 10:08 stucka