dada2 40% of reads survived the pipeline

40% of reads survived the pipeline

Open dsamoht opened this issue 3 months ago • 0 comments

Hi, We have difficulties to pinpoint the reason why we loose ~60% of our sequences after the dada2 pipeline. We ran the pipeline multiple times with a range of parameters for maxEE and truncLen. Here are some results of our "best" run.

Reads tracker (maxEE=c(2,5), truncLen=c(200, 190), other parameters at default) : reads_tracker_2_5_200_190.csv

Error plot (FWD): error_plots_FWD_2_5_200_190

Error plot (REV): error_plots_REV_2_5_200_190

run characteristics:

Environmental water data ;
V4-V5 region of the 16S (515FB: 5’-GTGYCAGCMGCCGCGGTAA-3’, 926R: 5’-CCGYCAATTYMTTTRAGTTT-3’);
Primers have been removed with cutadapt (with --discard-untrimmed);
Miseq v2 (250PE);

FASTQC plots: FWD: FWD_fastqc REV: REV_fastqc

We would greatly appreciate your opinion on this problem, thank you!

Mar 20 '24 19:03 dsamoht

dada2 dada2 copied to clipboard

40% of reads survived the pipeline

dada2
dada2 copied to clipboard