Rossi
Rossi
This is related to a `Opinion.local_path` correction process. The example ids are actually in the logs in this issue https://github.com/freelawproject/courtlistener/issues/3811#issuecomment-3548518189 @quevon24 can you check these ?
I think looking at a more normalized shape of the Opinion data may help deciding which shape we should use for this feature. However, the fully normalized shape may be...
I think it looks good. This would take advantage of Django content types framework and GenericForeignKeys to use a single table for all LLM stuff for all models that may...
You are right, if there are more types of prompts its better to keep the model as you first proposed it > Do you think it's worth creating a new...
`\d+ search_opinion` on the docker compose DB returns ``` Indexes: "search_opinion_pkey" PRIMARY KEY, btree (id) "search_opinion_author_id_69e3caa8" btree (author_id) "search_opinion_cluster_id_09bd537a" btree (cluster_id) "search_opinion_date_created_76a4ddf9" btree (date_created) "search_opinion_date_modified_524fb7ff" btree (date_modified) "search_opinion_download_url_8428ad91" btree (download_url)...
We can re-use code from opinion versioning to delete same hash duplicates - without creating stale links, via ClusterRedirection - merging different relevant metadata field, like "blocked" status This queryset...
Ran ``` ./manage.py delete_duplicates same_hash --verbosity 3 ``` Output ``` {'same cluster': 47, 'same docket': 20101, 'deleted opinion': 40445, 'deleted cluster': 40398, 'deleted docket': 20344, 'not comparable docket': 2794, 'merging...
Looking at the Columbia archive duplicates `source = 'Z'`, most of them come in pairs, separated by a few miliseconds. Maybe an error on the import code? They seem pretty...
Checking the Lawbox duplicates `source = 'L'` - not timestamp clustered - probably running the import twice, without checking for duplication ```sql courtlistener=> select a.date_created, case_name, a.id opinion_id, source, sha1,...
Only 776 hashes have more than 1 source; ```sql select sha1, count(distinct(source)) from search_opinioncluster a inner join ( select cluster_id, sha1 from search_opinion where sha1 in ( select sha1 from...