uta icon indicating copy to clipboard operation
uta copied to clipboard

Erroneous/duplicative HGNC names `ZHX1-C8ORF76` and `ZHX1-C8orf76`

Open jsstevenson opened this issue 4 months ago • 0 comments

Describe the bug The hgnc table contains entries for both ZHX1-C8ORF76 and ZHX1-C8orf76. The view tx_exon_aln_v contains alignments for each.

To Reproduce SELECT * FROM tx_exon_aln_v WHERE hgnc = 'ZHX1-C8ORF76';

SELECT * FROM tx_exon_aln_v WHERE hgnc = 'ZHX1-C8orf76';

Expected behavior The all-upper-case value is erroneous/an artifact of earlier data/something to that effect -- in HGNC symbols, I believe the open read frame "orf" is always supposed to be lower case, and they don't have an entry for the all upper case version: https://www.genenames.org/tools/search/#!/?query=ZHX1-C8ORF76

Additional context I'm guessing this is just an old value getting stuck somewhere in a table that UTA pulls from, but wanted to put this up in case it's still an issue in the in-progress latest data release.

jsstevenson avatar Oct 08 '24 15:10 jsstevenson