sgkit icon indicating copy to clipboard operation
sgkit copied to clipboard

Document best practice of windowing after filtering varaints

Open timothymillar opened this issue 3 years ago • 0 comments

sgkit stores genomic windows as the indices of their first and last variants within each window. This means that filtering variants (i.e. dropping some variants from the dataset) will invalidate the indices in window_start and window_stop. This should be documented and users encouraged to generate genomic windows after filtering any variants.

Related discussion: https://github.com/pystatgen/sgkit/discussions/786#discussioncomment-1936669

timothymillar avatar Jan 09 '22 21:01 timothymillar