strange behavior for gene validator
try PAQR10:


It was related to this: https://github.com/cBioPortal/cbioportal/issues/6835
There are many genes with duplicate symbols: https://docs.google.com/spreadsheets/d/1faa6pufHkwlFFNhSOIE4X-c8Z-rrB2GJof7L_QviXOs/edit#gid=0. We can try to manually replace maybe some as a temp fix while @rmadupuri et al continue the switch to HGNC effort
Yichao will review the protein coding genes that have multiple entrez gene IDs and fix them in the database: https://docs.google.com/spreadsheets/d/1faa6pufHkwlFFNhSOIE4X-c8Z-rrB2GJof7L_QviXOs/edit#gid=1770544155
@jjgao @inodb The results are in this spreadsheet uploaded here. Some entrez ID can be clearly removed but a lot of them are still fuzzy. I posted the specific TODOs and questions for each gene in this sheet. db_duplicate_protein_coding_genes.xlsx
@yichaoS I forgot what we decided. Are we going to manually remove the redundant ones. Or are we going to fix this as the gene data refreshing effort?
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.