foldseek icon indicating copy to clipboard operation
foldseek copied to clipboard

foldseek easy-cluster iteratively with different batches at different times

Open josephhughes opened this issue 11 months ago • 1 comments

Hi,

Is it possible to do foldseek easy-cluster at different points in time with different batches without needing to reprocess everything. For example, I have 10,000 pdb files that I clustered today. Then in 3 weeks time, I add another 10,000 sequences to the folder of pdb files. When I run foldseek easy-cluster, is there a way for me to tell it that it can use the results of the first 10,000 files to minimise compute?

josephhughes avatar Mar 05 '24 16:03 josephhughes

Hi, I am also interested in this possibility. Thanks in advance.

CRC63 avatar Mar 27 '24 15:03 CRC63