biojupies icon indicating copy to clipboard operation
biojupies copied to clipboard

Human gene symbols are converted to uppercase

Open leonfrench opened this issue 7 years ago • 3 comments

Hi,

The official human gene symbols with lowercase letters like 'C1orf226' end up getting converted to all uppercase - 'C1ORF226'. This is a minor issue. Still, hopefully, it's easy to fix.

I looked in the repo for 'upper' but didn't see a clear place for a bug fix. I've seen this in two human datasets but I haven't looked at others.

Thank you for creating such a useful tool!

leonfrench avatar Jan 11 '19 23:01 leonfrench

Thank you for the feedback!

Did you encounter the issue in a downloaded gene expression file, or in one the plots displayed in the notebooks? If so, which plot(s)? Feel free to provide a link to a notebook, I'd be happy to look into it.

denis-torre avatar Mar 04 '19 20:03 denis-torre

It was in the downloaded gene expression file too, so I think it happens in the ARCHS4 pipeline.

Any human dataset should work as an example: https://amp.pharm.mssm.edu/biojupies/notebook/UtXk3iNLL Has it in the downloaded expression txt file - C1ORF226 or any COrf gene.

Thanks,

leonfrench avatar Mar 04 '19 21:03 leonfrench

It does indeed happen in the ARCHS4 pipeline, thank you for pointing it out. We will fix this in the next ARCHS4 version (v7) - I'll update the issue as soon as it is released.

denis-torre avatar Mar 04 '19 21:03 denis-torre