SURVIVOR icon indicating copy to clipboard operation
SURVIVOR copied to clipboard

Simulating SV in PacBio reads

Open priyambial123 opened this issue 1 year ago • 6 comments

Hello

I need some suggestions in simulating complex structural variants in Hifi reads file from PacBio https://downloads.pacbcloud.com/public/dataset/HG002-CpG-methylation-202202/m64011_190830_220126.hifi_reads.bam I could understand from the wiki page that SVs can be created in the reference genome. How to simulate these variants in a file that I am interested in.

Thank you

priyambial123 avatar Apr 11 '23 16:04 priyambial123

Hi , so just to get this right, you want to simulate SV and use real reads? This is also based on the reference option. There is an option which way you want to simulate. In this case you would need to map the real reads to the so modified reference.

Hope that helps Fritz

fritzsedlazeck avatar Apr 11 '23 16:04 fritzsedlazeck

I want to create structural variants in the downloaded PacBio data and run it in sv detection pipeline and see if the variant is detected. Is this possible

Thank you

priyambial123 avatar Apr 11 '23 16:04 priyambial123

yes, see here https://github.com/fritzsedlazeck/SURVIVOR/wiki#quick-start change the options as described in the text.

fritzsedlazeck avatar Apr 11 '23 16:04 fritzsedlazeck

Thank you. I have to replace the reference.fasta with the dwonloaded fasta file. Is this right?.

priyambial123 avatar Apr 11 '23 16:04 priyambial123

please read the instructions. you need to change the one option 0 to 1 . and then remap your reads to the newly generated fasta file

fritzsedlazeck avatar Apr 11 '23 17:04 fritzsedlazeck

Thank you. So, I did these steps:

Simulated the structural variations in the reference genome using the parameters given in package:

./SURVIVOR simSV "/SURVIVOR/Debug/human_GRCh38_no_alt_analysis_set.fasta" "/SV_tools/SURVIVOR/Debug/parameter_file" 0.1 0 simulated

Then simulated the reads using simlord:

simlord --read-reference/SV_tools/SURVIVOR/Debug/simulated.fasta -n 10000 myreads

I ran it in my SV detection_workflow and there were no structural variants in vcf file. Is this because of the low number of reads generated?

Thank you

priyambial123 avatar Apr 17 '23 11:04 priyambial123