dada2 icon indicating copy to clipboard operation
dada2 copied to clipboard

Reads loss during nonchim step.

Open hh1985 opened this issue 1 year ago • 1 comments

I applied dada2 to process my reads and received following QC results: image

Some samples had about one third of the reads removed in the nonchim step. I am not sure if this is reasonable.

Background: The data is paired-end data, and sequenced by BGI machine (V4, 515F (5’-GTGCCAGCMGCCGCGGTAA-3’) and 806R (5’- GGACTACHVGGGTWTCTAAT-3’)). The primers had been trimmed when I received the data. I further used trunclenf 180 and trunclenr 150 for the dada2 trimming: image

hh1985 avatar Dec 18 '23 02:12 hh1985

On the level of raw read fates, what you are seeing is not out of bounds, but it is a high level of chimera-flagged sequences. This is at the upper range of what we see from legitimate amplicon sequencing data, and is usually associated with higher than normal numbers of PCR cycles (30+) or non-standard mixtures of samples prior to PCR.

The primers had been trimmed when I received the data.

The other reason we see data like this is that the primers weren't really trimmed from the data, or at least not efficiently. If you are using an ASV method like DADA2, you are almost always best off obtaining the raw untrimmed data and working from that if you can.

benjjneb avatar Jan 04 '24 04:01 benjjneb