cycledash icon indicating copy to clipboard operation
cycledash copied to clipboard

RNASeq expression worker

Open ihodes opened this issue 9 years ago • 9 comments

@iskandr:

PGV relies on allele-specific quantification of expression from RNAseq, how can that data be shown in Cycledash?

from https://github.com/hammerlab/cycledash/issues/852

ihodes avatar Oct 08 '15 21:10 ihodes

Some near-term ways to address this might be:

  • allow an RNA BAM to be specified on run submission
  • allow a third BAM (the RNA BAM) to be displayed in the pileups

arahuja avatar Oct 09 '15 15:10 arahuja

cf. https://github.com/hammerlab/cycledash/issues/790 for multiple BAMs in pileup

allow an RNA BAM to be specified on run submission

Right now you can submit as many BAMs as you'd like; would you want a tag on a BAM designating it as RNA data?

ihodes avatar Oct 09 '15 15:10 ihodes

Yes, you can submit any number of BAMs, but you can't submit a run with multiple tumor BAMs:

image

But assuming the "multiple BAMs in a pileup" feature allows you choose any other BAM from the project, I guess it doesn't matter too much. But, I could also see you wanting to tie a run to a specific RNA BAM to get the allele specific stats. Tagging a BAM as RNA also makes sense.

arahuja avatar Oct 09 '15 16:10 arahuja

That makes sense; thanks for the explanation!

ihodes avatar Oct 09 '15 16:10 ihodes

@iskandr: what is the summary statistics that you would like to show if we have the RNA-bam feature implemented? Some measure of within-sample normalized expression for a variant? Or simply number of reads supporting that particular variant?

Let me know if this needs an off-line discussion for a through discussion.

armish avatar Nov 03 '15 20:11 armish

I think that (1) the number of reads that contain a variant (2) the number of reads overlapping the locus and (3) the normalized expression value for that variant would all be good to see. What do you think?

iskandr avatar Nov 03 '15 21:11 iskandr

  1. Definitely a go-go;
  2. We have to define what a locus is for our purposes. Do you mean counting all reads falling into the gene locus for the corresponding transcript?
  3. Would be great to have that, but I first have to read on normalizing expression within a single sample. Did your manual PGV pipeline have this already?

armish avatar Nov 03 '15 21:11 armish

@ihodes: OK for me to start working on this? I am planning to add an RNASeq BAM file field to runs as a first step just to quickly get it done. I think we can ideally have a more flexible BAM submission for users where they can attach any number of BAMs with a controlled label (e.g. Normal, Tumor, RNASeq, etc.) but that can be our next goal. What do you think?

armish avatar Nov 06 '15 19:11 armish

@armish I'd defer to @iskandr as to what he needs—I had the impression we're waiting on a new tool he's developing next week, and that existing tools wouldn't get us what we need.

ihodes avatar Nov 06 '15 20:11 ihodes