VariantAnnotation icon indicating copy to clipboard operation
VariantAnnotation copied to clipboard

Create protein sequences including variants from a VCF file

Open baderd opened this issue 5 years ago • 3 comments

Dear BiocTeam,

I am investigating the proteome of human cancer samples and want to insert their genetic variations into the reference proteome fasta sequences to increase the sensitivity of my peptide/protein quantification.

Can you implement this "proteomeVariantInsertion()" in the VariantAnnotation package?

The VariantAnnotation::predictCoding() function already translates codons at variant positions from a reference BSgenome object to assess the consequences of a variant. I would like to take all coding variants (or just non-synonymous SNVs for a start) and insert them into the reference proteome, then save the modified fasta file.

See also my post in the Bioconductor forum.

Thanks, Daniel

baderd avatar Mar 08 '19 12:03 baderd

Do you want to modify the DNA sequences or the protein sequences? I think BSgenome::injectSNPs() should get you pretty close to the former, and it wouldn't be too hard to adapt to the latter. Either way, I'm not sure this belongs in VariantAnnotation.

lawremi avatar Mar 08 '19 13:03 lawremi

I agree that my request sounds odd for VariantAnnotation, but is rather fitted for manipulation of Biostrings or similar. The reason I linked my original question to this package is that predictCoding() does already everything I want except translating only single positions and not the protein isoform as a whole.

Meanwhile, I suggested a solution workflow in the Bioc forum.

baderd avatar Mar 12 '19 07:03 baderd

"proteomeVariantInsertion()" would be very useful.

karlmakepeace avatar Sep 27 '21 22:09 karlmakepeace