hivtrace
hivtrace copied to clipboard
Cluster diameter related features
It might be desirable to
- Report the diameter of any cluster, i.e.
max (k, n in cluster) = distance (k, n)
. - Permit, optionally, to reduce/split clusters so that a maximum diameter specified a priori is enforced. The strategy for how to enforce diameter maxima is not entirely clear, because multiple options appear feasible.
A general implementation would either require all pairwise TN93 distances, as opposed to only those that are ≤ threshold) to be available (not desirable due to file sizes), or the ability to recompute distances on the fly, which would entail subsequent calls to TN93 to compute distances between all sequences in a cluster.