Feature Request: taxonomy with blast databases
Hi!
I'm just trying out the new 'blast formatted databases' feature - or, rather, I was about to try it out. However, I see that taxonomy reporting is not supported for blast databases. I'd like to request this as a feature for the future!
Thanks,
Darren
Hi Darren, that's definitely going to happen, hopefully in the near future.
Hi Darren, that's definitely going to happen, hopefully in the near future.
Any update on this feature?
It's on a long todo list, but I'll see what I can do.
Hi,
Is there any update on this, or suggestions for a workaround? I was hoping to use diamond to search a large dataset for bacteria-aligning sequences, and this is somewhat of a dead end....
Is there any update on this, or suggestions for a workaround?
You can use the diamond database format which supports taxonomic information.
Hi,
Thanks for the input, I did consider this. Unfortunately we're using the entire NCBI nr database so we can cover everything. Although I prepared it for use with diamond (prepdb), I don't have (and can't reasonably get) all of the raw sequences to manually make it as a diamond database. As such, I'm not really sure that's an option, unless there's a pre-existing version of the NCBI nr database in diamond format somewhere!
You can download the nr sequences from NCBI https://ftp.ncbi.nlm.nih.gov/blast/db/FASTA/, or extract them from the nr database blastdbcmd -entry all -db nr -out nr.fasta