Bismark icon indicating copy to clipboard operation
Bismark copied to clipboard

Bismark mapping issue

Open ccmeth opened this issue 2 years ago • 1 comments

Hi Developers,

Thanks a lot for providing such a useful tool. When I used the bismark code, I met with a very strange issue that did not appear before. I have three WGBS samples with paired-end sequencing. The size of each R1.fastq.gz is about 30G. I ssh the server three times to process the three samples simultaneously. However, the processes all ended abnormally after about ~60 million sequences were processed. Then I tried to check the generated bam files, however, when I entered the directory harboring the fastq and bam files, "ls" does not give any response and "ctrl+c" does not kill it. Could you please tell me how to fix such an issue?

Bismark version: 0.23.1 bismark --genome /data/REF/hg19 -1 xxx_R1.fastq.gz -2 xxx_R2.fastq.gz

ccmeth avatar Nov 21 '22 05:11 ccmeth

Hi @ccmeth

I am afraid I would need some more details regarding your procedure, especially the error message you are seeing. What exactly are you doing, which type of libraries do you have, what are the system resources on the machine you ssh into, did you perform trimming of the files beforehand etc. It might also be worth starting with a smaller sample to see if that completes well. To select say the first 1 million reads, you can do:

gunzip -c xxx_R1.fastq.gz | head -4000000 | gzip -c - >xxx_1M_R1.fastq.gz
gunzip -c xxx_R2.fastq.gz | head -4000000 | gzip -c - >xxx_1M_R2.fastq.gz

(again, ideally you should trim the reads first, e.g. with Trim Galore).

FelixKrueger avatar Nov 21 '22 19:11 FelixKrueger