UNCALLED
UNCALLED copied to clipboard
Building reference gene panel, advise
Hi there,
I tried to use uncalled to improve the efficiency of cas9 targeted sequencing on low inputs and it seems the reference I generated rejected the reads after ~1KB instead of continuing to sequence the rest of the read.
uncalled realtime mouse_cas9panel.fa --port 8000 -t 8 --enrich -c 3 > uncalled_realtime.paf
I have not had the same issue with enriching in small genomes, like virus for example in a mixture of host DNA, or with a reference panel based on hg38. Have you seen this before or know of problems with generating references from mm10?
I am designing a panel of genes for a large signature in mouse (>150 genes) and I want to make sure that this doesn't happen. Do you have any advise/ tips when making a reference fasta for a large panel of genes?
Ubuntu 20 Uncalled version 2.1 MinKnow GUI 4.1.22
Opps, I just discovered your suggestions for masking: https://github.com/skovaka/UNCALLED/tree/master/masking I will try that.
Sorry you're having trouble! Reference masking is a good idea. Just to make sure, does your reference extend all the way to the ends of your targets? I'd also recommend running UNCALLED on those "regular cas9" reads to see if they can map to you reference in standalone mapping mode (see "Fast5 Mapping" in the readme). I haven't worked with the mouse genome before, but if you're still having trouble after masking you could send me your reference and I could take a look.
I am not sure if masking helped. The regular cas9 reads definitely map to the reference. But we don't see an improvement in yield compared to whole genome sequencing. What is the best way to send you the reference so you can check it out? Thank you!
You can email your reference to skovaka1 grep "model name" /proc/cpuinfo
should return the model name.