clipseq icon indicating copy to clipboard operation
clipseq copied to clipboard

Reads with / characters in the read names cause the pipeline to fail

Open marc-jones opened this issue 3 years ago • 1 comments

Check Documentation

I have checked the following places for your error:

Description of the bug

If / characters are in the read names, then the pipeline will fail at the dedup step. This is caused, however, by STAR truncating read names based on the / character. This removes the UMI information which was added earlier in the pipeline, causing the dedup step to fail.

Steps to reproduce

Steps to reproduce the behaviour:

Running the pipeline with reads with a / in the read name. Example

Expected behaviour

Either a verbose error message saying that the reads are incompatible, or automatic removal or replacement of the / characters.

marc-jones avatar Oct 01 '21 15:10 marc-jones