odp icon indicating copy to clipboard operation
odp copied to clipboard

find duplicate protein substrings with cd-hit

Open conchoecia opened this issue 1 year ago • 2 comments

use cd-hit to find duplicate protein substrings in the input protein files

use this for "best" filtering option

needs these files: cd-hit cdhit.c++ cdhit-common.h cdhit-common.o cdhit.o cdhit-utility.o Makefile license.txt

conchoecia avatar Jul 05 '23 09:07 conchoecia