hts-specs
hts-specs copied to clipboard
Clarification of empty vector representation
as triggered by https://github.com/samtools/bcftools/issues/1622 and https://github.com/samtools/hts-specs/issues/593
State explicitly that missing vectors (.
) in fields of variable length are in BCF represented as MISSING,END_OF_VECTOR
, i.e. the leading MISSING
value must be always present.
#593 makes a case that [END_OF_VECTOR, END_OF_VECTOR]
would also be a reasonable encoding of this situation.
What is the reason for requiring that “[e]mpty vectors (i.e. vectors with no data available)” be represented as [MISSING, END_OF_VECTOR]
(which is seemingly less empty!)?