Mark up ids of companies named in cases
Lots of cases mention corporations in their title, and perhaps in their body.
Sainsbury's Supermarkets Ltd, R (on the application of) v Wolverhampton City Council & Anor [2010] UKSC 20 (12 May 2010) Gold Group Properties Ltd v BDW Trading Ltd [2010] EWHC 1632 (TCC) (01 July 2010)
We should get together with OpenCorporates to name match those, so you can easily find a list of cases about one company, and so you can hyperlink to more info about the company the other way.
I've got some code for (reasonably) efficiently searching for many, many strings simultaneously; I'm currently testing it out on finding citations to judgments inside other judgments.
This may work on that.
In order to attempt this, it would be easiest if we had a sqlite .db file containing a table of company names and OpenCorporates URLs. Do you think you could obtain one (or obtain something that could easily be converted to one)?
I should make it clear I'm now completely confident that we can search through the archive for very large numbers of strings (presumably even in the millions) simultaneously in an acceptable time period: this project is now technically easy if we can get the data.
If not, my Companies House scraper can get that for you very easily...