goci icon indicating copy to clipboard operation
goci copied to clipboard

Permanent fix for duplication of merged SNPs on import

Open ljwh2 opened this issue 1 year ago • 2 comments

Duplicate SNPs cause the data release to break and issues in data presentation, please refer to https://github.com/EBISPOT/goci/issues/1275 for more details. Ala cleaned up all the existing examples in the database but code changes are required to stop it happening in future.

Acceptance criteria for this issue:

  • Curators can extract merged SNPs using the merged rsID
  • unpublished SNPs do not break the DR
  • published SNPs appear only once in the UI & download

ljwh2 avatar Oct 02 '24 18:10 ljwh2

On further analysis, this may be a problem on Ensembl side, as none of the duplicated SNPs are found in Ensembl (though they are found in dbSNP). This issue needs a bioinformatician to look at it.

ljwh2 avatar Oct 02 '24 18:10 ljwh2

Moving to icebox as no further examples have been encountered

ljwh2 avatar Jan 07 '25 14:01 ljwh2