SISRS
SISRS copied to clipboard
Subsample via normalization?
Future thought: Instead of subsampling reads randomly, perhaps we could be assembling better genomes and getting more representative mapping if we instead:
- Get raw reads for X species
- Trim etc.
- For 10X Mapping Data: Cat and normalize reads from each species by kmer to ~10X coverage (bbamp can do this)
- For Genome Data: Subsample normalized reads to create genome data.
Check out other aligners, specifically with functionality at degenerate bases.