pancancer
pancancer copied to clipboard
Updating Repo
The embargo on the data has lifted (see #75) and the paper has been published (See release v2.2
).
Therefore, it is time to update the repo. I will use this time to update the following:
- Adding all of the results of each analysis.
- Refactor code (focusing on
scripts/pancancer_classifier.py
) - Updating gene identifiers in gene expression data.
- This will require an updating to entrez gene ids to maintain consistency in other applications
There are also some research questions that will need to eventually be addressed.
- Transforming data to match distribution of training - removing the requirement for normalizing population of samples and permitting classification of a single sample.
I will use this issue to track thoughts, but this project to track progress