pileup: Count reads supporting each base at each position
I have a feature request. I'd like to see a pileup command that reports the number of reads that support each base at each position. I'm not interested in indels.
Thanks, Shaun
Hi Derek, @pezmaster31
The pileup-like output format I'd suggest would be eight columns in a tabular format, and quite a bit more concise than the pileup format when the additional information that pileup provides isn't necessary:
chr pos ref A C G T N
Where ref is the reference base, and A, C, G, T and N are the number of reads supporting each base at that position. Thoughts?
Cheers, Shaun
@sjackman, would this tool by @arq5x fit your needs? https://github.com/arq5x/piledriver
Thanks @jts, I'd forgotten about @arq5x's tool.
Will that work for you, @sjackman? It's more detailed than what you're looking for, but avoids re-inventing the wheel. At a quick glance, I don't see a 'num_N' field. Might be able to infer that from the data, not sure. If not, I know Aaron is open to feedback & suggestions.
Yeah, num_N is not implemented, but could be very easily. Unfortunately, I am on my way up to Canada for vacation, but can hack it when I get back.
Yes, piledriver looks very useful. It's also 30% faster than genome/bam-readcount (the other tool that I found for this job). Thanks, all. Derek, Aaron, would there be any interest in incorporating piledriver into bamtools? It seems to me that a simple pileup tool is conspicuously missing from bamtools.