tskit icon indicating copy to clipboard operation
tskit copied to clipboard

Support windows in stats API that do not span whole genome

Open jeromekelleher opened this issue 2 years ago • 1 comments
trafficstars

Currently the stats API requires that we cover the entire genome with windows, which is restrictive. In particular, it prevents us from parallelising along the genome in a simple way.

The main roadblock for this is the low-level implementation of the stats, which use incremental algorithms that don't support starting and stopping easily.

This would be an excellent application for the approach discussed in #2778

jeromekelleher avatar Jul 07 '23 11:07 jeromekelleher

Note this will have to take into account the issue discussed in #2792, where we need to think about the consequences of edges not being inserted into the tree in time-sorted order when we seek to a given position. Almost certainly the right thing to do for the stats API will be to do a postorder traversal (but this requires that we keep the full quintuply linked tree)

jeromekelleher avatar Jul 13 '23 14:07 jeromekelleher

Closing for inactivity and labelling "future", please re-open if you plan to work on this.

benjeffery avatar Jun 12 '25 22:06 benjeffery