MMseqs2 icon indicating copy to clipboard operation
MMseqs2 copied to clipboard

linclust on fasta file or DB file?

Open MarliesJFrancine opened this issue 1 year ago • 1 comments

Hi,

I want to cluster a large dataset of DNA sequences. Must I first convert my fasta file into a DB format file? As is written here: https://github.com/soedinglab/MMseqs2/wiki#linclust, or can I use my fasta file directly? As is written here on the GitHub page.

What is the best approach here?

Kind regards, Marlies

MarliesJFrancine avatar Sep 24 '24 10:09 MarliesJFrancine

You can use mmseqs easy-linclust with FASTA file, or convert your FASTA file into a DB file by mmseqs createdb and then cluster it by mmseqs linclust .

Prangejet avatar Oct 30 '24 04:10 Prangejet