tskit issues

Python 3.12 support

1

Python 3.12 has been released. The good news is that tskit builds and almost all tests pass. The failing ones are those that depend on `numba` via `lshmm` as `numba`...

benjeffery

Infrastructure and tools

Remove legacy h5py based formats

5

Opening this up for discussion. As mentioned by @jeromekelleher at https://github.com/tskit-dev/tskit/pull/2811#issuecomment-1663778875 it has been a long time since these legacy formats were used. When dropping them we should add a...

benjeffery

Remove tsk_diff_iter_t

4

After #2786 we don't actually use tsk_diff_iter_t at all in the library. As it's not part of the public API (and it causes some annoying problems internally, e.g. [here](https://github.com/tskit-dev/tskit/blob/f7ba5489ae9fa7bede54ad856f181c85f8759f6e/c/tskit/trees.c#L447)) I...

jeromekelleher

C API

Update stats API algorithms to use tree_position_t

1

We should be able to substantially simplify the stats API algorithms by using the tsk_tree_position_t class. This should be done before we generalise to windows that are not [0, L)...

jeromekelleher

Add parallelism to stats API

Once #2782 is implemented we can easily support threading along the genome by following the approach for divergence matrix in #2736.

jeromekelleher

enhancement

Use divergence_matrix for downstream statistics

3

I think we can rephrase at least ``genetic_relatedness`` (aka eGRM) in terms of ``divergence_matrix``, which should substantially improve performance (although waiting for #2779 which is needed for decent site-mode performance)....

jeromekelleher

Support windows in stats API that do not span whole genome

1

Currently the stats API requires that we cover the entire genome with windows, which is restrictive. In particular, it prevents us from parallelising along the genome in a simple way....

jeromekelleher

enhancement

Document divergence_matrix

6

Following up on #2736, we need to document the function. Note that I left the old partially implemented version of divergence matrix as a commented out block here as it...

jeromekelleher

documentation

Add support for ``individuals`` to divergence_matrix

Currently the divergence matrix supports a list of ``samples``. It would also be useful to support ``individuals`` as a mutually exclusive option. Initially we can implement this by post-processing the...

jeromekelleher

enhancement

Add docs discussing how to get a matrix of pairwise comparisons

3

this is a common thing to want to do

hyanwong

tskit
tskit copied to clipboard

Metadata

Python 3.12 support

Remove legacy h5py based formats

Remove tsk_diff_iter_t

Update stats API algorithms to use tree_position_t

Add parallelism to stats API

Use divergence_matrix for downstream statistics

Support windows in stats API that do not span whole genome

Document divergence_matrix

Add support for ``individuals`` to divergence_matrix

Add docs discussing how to get a matrix of pairwise comparisons

← Metadata

Owner

Metadata

tskit tskit copied to clipboard

Metadata

← Metadata

Owner

Metadata

tskit
tskit copied to clipboard