foldseek
foldseek copied to clipboard
foldseek easy-cluster iteratively with different batches at different times
Hi,
Is it possible to do foldseek easy-cluster at different points in time with different batches without needing to reprocess everything. For example, I have 10,000 pdb files that I clustered today. Then in 3 weeks time, I add another 10,000 sequences to the folder of pdb files. When I run foldseek easy-cluster, is there a way for me to tell it that it can use the results of the first 10,000 files to minimise compute?
Hi, I am also interested in this possibility. Thanks in advance.