juriscraper icon indicating copy to clipboard operation
juriscraper copied to clipboard

feat(Alabama): Resurrect Alabama Supremes

Open flooie opened this issue 4 years ago • 3 comments

Draft PR for establishing PDF parsing

And the resurrection of the Alabama Supremes.
The parsing is simple enough and the PDFs are standardized enough to use some simple logic to parse them all out

But I've not added to abstract before and I wanted to get a conversation going.

flooie avatar Dec 20 '21 02:12 flooie

@mlissner As a son of the state of Alabama, I think its time we brought it back into the courtlistener fold.

I used -- pdfplumber and I think it works nicely enough. Although I'm sure this may need a broader discussion with @arderyp and @johnhawkinson and @satsuki-chan

I thought I would at least kick it back off. Between the number of courts that do this its probably time to add PDF parsing, and some testing infrastructure.

flooie avatar Dec 20 '21 14:12 flooie

this looks clean to me. My only suggestion is perhaps more commenting to explain some of the more cryptic PDF looping stuff.

Let's hope they have some consistent/automated way of producing these PDFs. I've generally had a rough time parsing human generated PDFs online, but we might as well give it a go!

arderyp avatar Dec 20 '21 16:12 arderyp

@arderyp I took the general idea - and put it to work in Colorado - I think it translates nicely.

flooie avatar Dec 20 '21 16:12 flooie