MAPLE
MAPLE copied to clipboard
Extension for Maple format: including insertions
In order to allow lossless compression, insertions need to be included in the Maple format.
I could find this edge case specified in the preprint.
It's easy to do, one just needs to agree to a convention, e.g.
2134 ins ACGTT
for an insertion of ACGTT after (or before) nucleotide 2134.
Alternative: no need for magic word, one simply includes multiple letters instead of one (I think this would be akin to VCF). If 2134 is usually C, one would write:
2134 CACGTT
for an insertion of ACGTT after nucleotide 2134.
Would be good of you could include treatment of insertions in the preprint.
I think both proposals would work in principle. Both have advantages.
The first is a bit more explicit, the second doesn't require a magic word.