warn-scraper
warn-scraper copied to clipboard
Incomplete data coming through
Bot sent through something with no state postal code, even CHARTER COMMUNICATIONS LLC - CENTRAL REGION Notice date: 2023-07-26 FALLON HEALTH WEINBERG, INC. - WESTERN REGION Notice date: 2023-07-20
The initial problem is likely one with warn-bot, but it highlighted another.
New York's scraper is combining fields, such as Charter Communications LLC - Central Region
Company name can be separated as " - ".join(companyname.split(" - ")[:-1]
Region name can be separated as companyname.split(" - ")[-1] or companyname.split(" - ")[-1].replace(" Region", "")
Not sure if that would duplicate data n the database as the company name would shift considerably.