moscot icon indicating copy to clipboard operation
moscot copied to clipboard

symbol column key argument when using gene sets

Open ivirshup opened this issue 1 year ago • 1 comments

This is a feature request for methods like score_genes_for_marginals.

It's generally bad practice to use gene symbols as the var_names for an AnnData object, since they're not unique and can easily become ambiguous. Unique identifiers, like ensembl IDs, are better as primary keys for genes.

It would be great if methods like score_genes_for_marginals allowed using ensembl ids as the primary keys, and let users specify the which column in var provides gene symbols with a key word argument.

ivirshup avatar Sep 26 '23 07:09 ivirshup

Hi @ivirshup,

Thanks for raising this!

Which database would you recommend for gene ids?

MUCDK avatar Oct 04 '23 14:10 MUCDK