juriscraper
juriscraper copied to clipboard
Add scrapers for Texas court data
Adds scrapers to handle Texas court data along with a Scraper ABC, which future scrapers should inherit from. Uses TypedDicts to ensure output of Texas scrapers is always in the appropriate schema.
Resolves #1680
Changes
- Add
ScraperABC, which scrapers should inherit from, in order to help with type hints and ensure that all scrapers have necessary methods implemented. - Add
juriscraper.statesubmodule to organize Texas scrapers and future state scrapers. - Add
TexasSupremeCourtScraper,TexasCourtOfCriminalAppealsScraper, andTexasCourtOfAppealsScraperscrapers to handle their respective docket data, along withTypedDicts to define output schema for each scraper. - Add
TexasCommonScraperto contain extraction functionality common to all three courts. - Add basic
parse_tablemethod tohtml_utils.pyto simplify extraction of tabular data from HTML documents.