MutSig2CV
MutSig2CV copied to clipboard
Computing background mutation rates from MutSig2CV output on TCGA data
Hi,
I want to use the background mutation rates (BMRs) from MutSig2CV and I was wondering how to obtain those. I am currently looking at the output of MutSig2CV on TCGA data, e.g. at this link , and I have a few questions.
-
How do I obtain the BMRs \mu_{g,c,p} (e.g. as in the supplement of this Nature 2013 paper) from the MutSig2CV output? I was not sure which file has those. (I guess I would want the variables x_{g,c,p} and X_{g,c,p}, and could compute \mu_{g,c,p} = x_{g,c,p} / X_{g,c,p} ?)
-
In the Methods of the Lawrence et al 2014 paper, which is cited in the Github, in the "power analysis" section there is a quantity f_g, the gene-specific mutation rate factor. How do we get f)g from the output of MutSig2CV?
-
Is there any way to compute the number of "covered bases" per gene? Looking at the MutSig2CV output on the TCGA COADREAD cohort, each sample has a total number
N_ind
of covered coding bases. However the sum of the coding lengthscodelen
of all genes is larger thanN_ind
. So where in the results can I get the number of coding "covered bases" per gene?
Thank you very much!