foldseek icon indicating copy to clipboard operation
foldseek copied to clipboard

Create human-readable taxonomy lookup table from precomputed database

Open cvigilv opened this issue 10 months ago • 4 comments

I'm currently trying to use foldseek to prepare some datasets and I would like to check if the taxonomic information of Alphafold/Proteome matches the one I obtained from the FTP server of Alphafold.

Is there any way to convert the binary _taxonomy file into a tab-separated value?

Expected Behavior

Current Behavior

Steps to Reproduce (for bugs)

Please make sure to execute the reproduction steps with newly recreated and empty tmp folders.

Foldssek Output (for bugs)

Please make sure to also post the complete output of Spacepharer. You can use gist.github.com for large output.

Context

Providing context helps us come up with a solution and improve our documentation for the future.

Your Environment

Include as many relevant details about the environment you experienced the bug in.

  • Git commit used (The string after "MMseqs Version:" when you execute foldseek without any parameters):
  • Which foldseek version was used (Statically-compiled, self-compiled, Conda, etc.):
  • For self-compiled and Homebrew: Compiler and Cmake versions used and their invocation:
  • Server specifications (especially CPU support for AVX2/SSE and amount of system memory):
  • Operating system and version:

cvigilv avatar Apr 23 '24 15:04 cvigilv