reporters-db
reporters-db copied to clipboard
Statutory and regulatory citations from the Indigo Book
Related to #13 about adding law review abbreviations, what do you think of adding all of the non-case citation formats (statutes and regulations) from the Indigo Book's tables? (Or from another source if there's something easier to extract data from than that, though it doesn't look too bad.)
With the new regexes
field and with eyecite where it is, I think there's some low hanging fruit at this point to expand what we're extracting. We could get fancy, but it could be as easy as adding some new "cite_type": "statute|law_review|..."
values to reporters.json, dropping in regexes extracted from Indigo Book, and having eyecite inspect cite_type
to emit StatuteCitation
or RegulationCitation
or LawReviewCitation
instead of FullCaseCitation
.
"reporters-db" starts to become a little inaccurate at this point, but not sure if that's really an issue ...
We've now added T1. Federal Judicial and Legislative Materials in #56, and T3. U.S. States and Other Jurisdictions in #51.
What's missing still is T2. Federal Administrative and Executive Materials. There's a lot of interesting stuff in there, but with so many edge cases I'm not yet picking it up for the moment. For example:
- There's some stuff that wants to go in reporters.json, like "55 Agric Dec. 1121".
- There's some stuff that wants to go in laws.json like "Treas. Reg. § 1.41-2 (1989)" and "3 C.F.R. 216" and "59 FR 9061".
- There's some stuff that I'm not sure if it counts as citations or just docket numbers, like "ASBCA No. 43466, 93-1 BCA ¶ 25,508". (Should eyecite extract docket numbers?) Anything with
<secondary source if available>
probably falls into this category. - There's some specialty collections like trademark, patent, and copyright registrations that aren't exactly legal sources but would be interesting to include somewhere.
A lot of that is obviously important like C.F.R. and Fed. Reg.
In addition to T2, there are some other sections that still look valuable to mine for sources, with their own edge cases:
- "T4.2. Service Abbreviations", like "Am. Stock Ex. Guide (CCH)", "Admin. L.3d (BNA)". We have some of these in reporters.json already.
- "T6. Required Abbreviations for Treaty Sources", like "T.I.A.S. No. x", "
U.S.T. xxx" ... - "T7. Required Abbreviations for Arbitral Reporters"
- "T8.1. United Nations and League of Nations"
- "T8.2. Europe" (not sure how much it makes sense to pull this thread, vs. non-European sources)
I would be happy to take a stab at adding treaties (T6). Are people ok with them going in laws.json?
PR #121 adds treaties. I am planning to do arbitral reporters and state constitutions next.