glow icon indicating copy to clipboard operation
glow copied to clipboard

Option to consider last allele as reference in BGEN reader

Open henrydavidge opened this issue 4 years ago • 1 comments

Our BGEN reader always considers the first allele to be the reference. However, the BGEN spec does not codify the position of the ref allele, and it can vary from dataset to dataset.

We should accept a datasource option to configure this behavior. Relevant code: https://github.com/projectglow/glow/blob/master/core/src/main/scala/io/projectglow/bgen/BgenFileIterator.scala#L97

henrydavidge avatar Jan 27 '21 15:01 henrydavidge

Alternatively, it could be good to support aligning with a particular reference genome after the fact. More of an "ELT" approach

ptn24 avatar Jul 22 '21 00:07 ptn24