MMseqs2 icon indicating copy to clipboard operation
MMseqs2 copied to clipboard

convert mmseqs search results back to fasta

Open lzhangUT opened this issue 2 years ago • 0 comments

Expected Behavior

a fasta file with ID and sequences

Current Behavior

Steps to Reproduce (for bugs)

mmseqs databases UniProtKB db/uniprot tmp mmseqs createdb input_fasta queryDB mmseqs search queryDB db/uniprot resultDB tmp mmseqs convert2fasta resultsDB resultsDB.fasta

MMseqs Output (for bugs)

MMseqs Version: Use header DB false Verbosity 3

Database resultsDB needs header information

Context

it looks like everything is running fine, I got the resultDB.* files. Also I did mmseqs convertalis queryDB targetDB resultDB resultDB.m6 i can get the results in tab format. But my real goal here is to get the sequences for each fasta, not just the ids and scores, as my next step is to deduplicate the sequences and then do clustering.

Your Environment

Include as many relevant details about the environment you experienced the bug in. MMseqs Version: 7ebd2e0441e5c3bdec585317c2b1c3cdbf943568

any help would be appreciated. @milot-mirdita

lzhangUT avatar Sep 03 '22 02:09 lzhangUT