Martin Steinegger
Martin Steinegger
I will also upload a list of protein ids encoded on the short contamianted nucleotide contigs. I assume it might be useful for you to also remove these.
@xgz-98 Conterminator automatically downloads the taxdump from the NCBI site. You only need to provide a fasta file and the respective mapping from identifier to taxid.
@xgz-98 I have opened separate issue https://github.com/martin-steinegger/conterminator/issues/5
Might be solve by this pull request https://github.com/emepyc/Blast2lca/pull/5