kaiju icon indicating copy to clipboard operation
kaiju copied to clipboard

Taxonomic Annotation from orfs

Open tpellegrinetti opened this issue 2 years ago • 1 comments

Hi all, Sorry if this question is silly.

I have several fasta files from some kegg orthogoly of my interest which I extracted from prodigal and hmmsearch result. For example, the nifH, that is the K02586.faa.

This file started as follow: ">L04SR_181+k119_40695_8 # 7004 # 8506 # 1 # ID=40466_8;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.457"

I would like to make a custom database from this fasta orf files and then make the taxonomic classification in metagenomic reads using Kaiju.

It's possible?

I tried to do this but it not works for me. Follow the script that I used:

kaiju-mkbwt -n 5 -a protein -o K16395 K16395.faa kaiju-mkfmi K16395 kaiju -t /Storage/databases/KaijuIndexes/nodes.dmp -f K16395.fmi -i /Storage/data1/thierry.pellegrinetti/05_metagenomas/2018/L04SR18_1.fasta -o L04SR18_1.out kaiju2table -t /Storage/databases/KaijuIndexes/nodes.dmp -n /Storage/databases/KaijuIndexes/names.dmp -r genus -o L04SR18_1_summary.tsv L04SR18_1.out

tpellegrinetti avatar May 17 '22 19:05 tpellegrinetti

When you want to make a custom database for kaiju, you need to follow the format described here: https://github.com/bioinformatics-centre/kaiju#custom-database The fasta-headers need to contain NCBI taxonomy IDs, that are also present in nodes.dmp and names.dmp from the NCBI taxdump files.

pmenzel avatar May 24 '22 19:05 pmenzel