hh-suite
hh-suite copied to clipboard
HHFilter -diff M parameter return less sequences than M
Hi! I'm using HHFilter 3.3.0 to sample sequences from MSA file which contains tens of thousands of homologous sequences .
The command is:
/userhome/anaconda3/envs/proteins/bin/hhfilter -i /userhome/data/TS2/msa/7KVT_B.a2m -o /userhome/data/TS2/7KVT_B_filter.a2m -id 90 -diff 512 -cov 0 -qid 0 -qsc -20.0
I set -diff 512
parameter, which should return 512 or more sequences that maximize diversity (the result is usually close to 512 ) according to the Help manual, but infact I got less than 512 sequences.
I set different numbers for the -diff
parameter, and got diffrent results. The results are shown in the following figure, I think may be the number of sequences returned is weird. Is that normal?