warn-scraper issues

Fix NY: Pull fields such as # affected employees, total # employees, event number from individual PDFs

I have this working and will put up a PR shortly. The PDFs for NY are all well-structured, and this information is fairly easy to pull out with regexes.

vegasbiscuit

Bump pillow from 10.2.0 to 10.3.0

Bumps [pillow](https://github.com/python-pillow/Pillow) from 10.2.0 to 10.3.0. Release notes Sourced from pillow's releases. 10.3.0 https://pillow.readthedocs.io/en/stable/releasenotes/10.3.0.html Changes CVE-2024-28219: Use strncpy to avoid buffer overflow #7928 [@hugovk] Use functools.lru_cache for hopper() #7912 [@hugovk]...

dependabot[bot]

dependencies

How does this compare to other sources?

1

Back when this was started, there seemed to be a dearth of data about layoffs. Today, there are several websites with layoff data, some automated with "AI," some based on...

chriszs

Job Center data quality script(s)

1

> See the [Job Center docs](https://github.com/biglocalnews/WARN/docs/job_center.md) for background on the scraping strategy and issues described below. After cutting over to use the Job Center site class for AZ, DE, KS...

zstumgoren

data quality

Refactor FL to remove tenacity

1

Closes #642

chriszs

Update ID scraper to use state's new URL

Closes #644

chriszs

Update ID scraper to use state's new URL

1

Idaho moved its warn PDF from `https://www.labor.idaho.gov/dnn/Portals/0/Publications/WARNNotice.pdf` to `https://www.labor.idaho.gov/wp-content/uploads/publications/WARNNotice.pdf`. The scraper follows this transparently, so there's no breakage, but seems like a good policy to update the URL to reflect...

chriszs

warn-scraper
warn-scraper copied to clipboard

Metadata

Fix NY: Pull fields such as # affected employees, total # employees, event number from individual PDFs

Bump pillow from 10.2.0 to 10.3.0

How does this compare to other sources?

Job Center data quality script(s)

Refactor FL to remove tenacity

Update ID scraper to use state's new URL

Update ID scraper to use state's new URL

Refactor FL to remove tenacity

Clean up DC by removing dead code

Improve RI scraper by removing extraneous slash in URL, sparing redirects

← Metadata

Owner

Metadata

warn-scraper warn-scraper copied to clipboard

Metadata

← Metadata

Owner

Metadata

warn-scraper
warn-scraper copied to clipboard