GTDBTk
GTDBTk copied to clipboard
Subroutines for classify
I had some bad experiences with gtdb-tk classify which I run on large datasets.
-
For example, It passed all the steps but then failed because the symlinked files already existed.
-
It is relatively difficult to find optimal resources parameter to run the classify step because it involves pplacer + fastANI. Would it be possible to make two sub-routines for this step?
Hi, Thanks for the feedback We are looking to split the classify step into 2 different steps ( pplacer and ANI ). This should happen in the next release.
Thanks, Pierre