vak
vak copied to clipboard
improve split algorithm with some sort of subset covering solver?
from this blog post seems to me like we could frame problem of splitting dataset into splits of specified durations, all containing same set of labels ,as a subset cover problem, and maybe use one of the existing solvers? https://jeremykun.com/2020/10/20/optimization-models-for-subset-cover/