seqtk icon indicating copy to clipboard operation
seqtk copied to clipboard

added -B/-E to trimfq for keeping first/last INT bp and also -s for shortest read

Open ndaniel opened this issue 7 years ago • 1 comments

This is basically a resurrected issue https://github.com/lh3/seqtk/pull/38 which was brought up to date to the latest release of seqtk such that they do not interfere with the original command line options of seqtk anymore.

More precisely this adds for trimfq the following:

-s INT      trimming by -b/-e/-B/-E shall not produce reads shorter then INT bp
-B INT      keep first INT bp from left (non-zero to disable -q/-e/-E)
-E INT      keep last INT bp from right (non-zero to disable -q/-b/-B)

This allows a more precise control of how trimming is done. This kind of trimming is used heavily in FusionCatcher (by using a forked seqtk instead of the original seqtk). Here https://github.com/lh3/seqtk/pull/38 was mentioned that this kind of trimming is rare but actually in ractice is used a lot. Regarding popularity of such trimming, one has that, for example:

ndaniel avatar Mar 06 '17 08:03 ndaniel

Any news here @lh3 @ndaniel ?

yhoogstrate avatar Sep 14 '17 07:09 yhoogstrate