minced
minced copied to clipboard
Output with too long spacers
Hey,
By default the spacers should not be longer than 50bp, but in some cases arrays are identified even though some spacers are much longer than 50bp. See example below:
It comes from the following contig: crispr_test2.txt
I use version 0.3.1
Cheers, Jakob
CRISPR 8 Range: 73 - 690
POSITION REPEAT SPACER -------- ----------------------- -------------------------------------------------------------- 73 TTGTAGATTTGAACTTGTAGATT GAACTTGCAGATTTGAACTTGTAAATTTGAACTTGTAGATTGAACTTGTATATTGAACTTGTAGATTTTGTAGATTTGAAC [ 23, 81 ] 177 TTGTTGATTTGAACTTGTAGATT GAACTTGTAGATTGAACTTGTAGAATGAACTTGTAGATTGAACTTGTAGATTGAACTTGTAGATTTTGTAGATTTGAAC [ 23, 79 ] 279 TTGTAGATTTGAACTTGTAGATT GAACTTGTAGATTGAACTTGTAGATTTTGTAGATTTGAATTTGTAGATTTGAAGCTGTAGATTTGAACTTGTAGATTTGAAC [ 23, 82 ] 384 TTGTTGATTTGAACTTGTTGAAT TGAACTTGTAGATTGAACCTGTAGATTTTGTAGATTTGAACTTGTAGATTTTGTAGATTGAAC [ 23, 63 ] 470 TTGTAGATTTGAACTTGTAGATT TTCTACTGTAGATTTACTTCTGTAGATTTCAATGTGAAGATT [ 23, 42 ] 535 TTGTAGATTTGAACTTGTAGATT TGAAATTGAAGATTTGAACTTGTAGATTGAACTTGCAGATTGAAC [ 23, 45 ] 603 TTGTAGATTTGAACTTGTAGATT TTGTAGATTTGAACTTGTACATTTGAACTTGAAGAATTGAAC [ 23, 42 ] 668 TTGTCAATTTGAACTTGTAGATT -------- ----------------------- -------------------------------------------------------------- Repeats: 8 Average Length: 23 Average Length: 62
Looking at how this comes through the code, I think it'll be a little tricky to fix. Will have to think more on it, but don't have the time right now.
Ok, I guess post-processing would be the simplest. Also, 50bp would probably be a bit too short if it was a hard limit on the spacer lengths.