juriscraper icon indicating copy to clipboard operation
juriscraper copied to clipboard

Enhance Wisconsin Supreme Court scraper

Open flooie opened this issue 2 years ago • 2 comments

Wisconsin scraper is not down.

It must have been a one off event. But in taking a second look - we should add PDF parsing to the scraped opinions as citation information .. and a neutral citation to boot is embedded at the top of the PDF.

flooie avatar Dec 14 '23 17:12 flooie

We should add the following method to extract out citations in both Wisconsin Supreme and Court of Appeals

def extract_from_text(self, scraped_text: str) -> Dict[str, Any]:

flooie avatar Dec 15 '23 18:12 flooie

Wisconsin should be converted from OpinionSite to OpinionSiteLinear Additionally, all PDFs appear to have a nice and neat neutral citation at the start of the document.

Image

@grossir - this is a good opportunity to update this scraper and bring around a nice addition. Citations are super important, but not all courts produce a citation with their opinion. Often they are generated by for-profit companies months or even years later.

Whenever we have the opportunity we should grab an official citation.

flooie avatar Dec 28 '23 01:12 flooie