tskit icon indicating copy to clipboard operation
tskit copied to clipboard

Population-scale genomics

Results 291 tskit issues
Sort by recently updated
recently updated
newest added

Following up on #1596, we'd like to build out some basic functionality to help us work with cross coalescence statistics. The simplest thing we can do to start with (as...

Python API
statistics

## Description Fixes #1294. Should probably add some tests for attributes of the graph like `branch_length`, `left`, and `right`. # PR Checklist: - [x] Tests that fully cover new/changed functionality....

As shown in https://github.com/tskit-dev/msprime/discussions/1901 metadata validation is proving to be quite expensive for msprime simulations. One option would be to change our code to use https://github.com/horejsek/python-fastjsonschema It has [conda-forge](https://anaconda.org/conda-forge/python-fastjsonschema) and...

Python API
Performance

This script: ``` import tskit t = tskit.TableCollection(sequence_length=1) t.nodes.add_row(flags=1, time=0) t.nodes.add_row(flags=1, time=0) ts = t.tree_sequence() tr = ts.first() print("MRCA", tr.mrca(0, 1)) print("TMRCA", tr.tmrca(0, 1)) ``` produces ``` MRCA -1 Traceback...

enhancement
Python API

Reasoning: #2047. See also #464. Presumably we also want a `get_individuals_associated with_sample_nodes()` function.

enhancement
Python API

After a brief chat with @petrelharp we thought it might be time to revisit the general structure of https://tskit.dev/tskit/docs/stable/stats.html, because (for instance) it doesn't mention the LDcalc stuff, or have...

documentation
statistics

It would be good to add an example of using at least some of the statistics on the Statistics page. It's a bit of a wall of text at the...

documentation

An error that I have seen quite a few people make already is that they pass in a list of individual ids to `samples=`, instead of their constituent nodes. It's...

enhancement
Python API

The possibility of missing data is being added in #272; the stats need to be modified to take this into account. At least two issues: 1. Site stats: isolated samples...

enhancement

Following up on #1937, it would be generally useful to make the node based caching in the general stat framework optional in the C code. Then propagate the option back...

C API