Kevin Ramirez
Kevin Ramirez
While running columbia merger we found citations in which the volume also contains letters so the current citation model cannot handle that volume number since the field is defined as...
This PR contains the updated version of columbia importer, it contains many changes like: - Update codebase to match python 3.11 style - Replace deprecated functions - Typing added -...
This management command will allow us to correctly order the opinions of Columbia or Harvard. For harvard it's not that hard because those opinions were imported in the correct order,...
The citations merger for westlaw data is ready, we don't have much information in the dataset so this was the best way i could do it without messing with very...
The code have comments to understand what it does each part. How to run the command: `docker-compose -f docker/courtlistener/docker-compose.yml exec cl-django python manage.py lexis_citation_merger --csv /opt/courtlistener/cl/citations/management/commands/lexis_citation_merger.csv ` or `docker-compose -f...
This is the model to store the stub cases, it's very simple, all fields are optional because we don't always have the same information (like the westlaw and lexis datasets)...
Analyzing the data from the Columbia dataset, I found that some courts in the files did not completely match with the data we have in courtlistener. I think this happened...
Working with the bulk data I noticed that docket numbers contains em dashes or em dashes with additional spaces instead of just dashes, i think it would be useful to...
I added two useful commands to makefile to be able to generate and apply migrations manually I got to this since when implementing the new tracker in PR #85 I...
As I mentioned in previous meetings, I consider that it is easier to match the citations with the datasets that we have from the local environment and only generate a...