mixcr icon indicating copy to clipboard operation
mixcr copied to clipboard

PCR and Sequencing error correction improvement

Open PoslavskySV opened this issue 7 years ago • 0 comments

  1. Match quality and probability for more rigorous filtering of sequencing errors: a. Use MaxQuality strategy for overlap merger b. Use SumQuality for clone accumulators c. Filter out clones with big error probability (p = (1 - Prod{1 - 10^(-q_i/10)} ))
  2. Strategy for more rigorous PCR error correction: a. Introduce lower bound (lowerBound) for initial molecule count (approx. 10 * (total reads count) / (minimal clone count) ) b. Estimate error rate for each type of substitution (e.g. A -> T) by counting all variants in each cluster and averaging over all clusters with possible outlier detection c. Cluster minor clones that satisfies estimated error rates within e.g. 3sigma and (d.) use (total reads count) / lowerBound as absolute minimal concentration of minor clone for clustering
  3. Test all these things against real data

PoslavskySV avatar Jul 07 '16 12:07 PoslavskySV