ray icon indicating copy to clipboard operation
ray copied to clipboard

add scaffolder metrics

Open sebhtml opened this issue 12 years ago • 0 comments

That's a good point.

A good metric that Ray could produce to start with would be the number of pairs (including mates) with:

  1. both ends within a contig;
  2. one end on one contig and the other end on another contig
  3. one end on one contig and the other not mapped
  4. both ends not mapped

You suggest that a sizable part of the pairs (including mates) arein 3. and 4. when using a k-mer length of 61-91. That's likely.

I think it is probably the case as mate pairs usually include also an adapter too, and that consume previous space in the sequences.

For the time being, I believe that "use another scaffolder" is your best bet.

Speaking of scaffolders, I will soon (hopefully) fix the speed issue for scaffolding of large genomes due to repeated k-mers [1].

sebhtml avatar Dec 13 '12 22:12 sebhtml