dada2 icon indicating copy to clipboard operation
dada2 copied to clipboard

Issues with assignTaxonomy Function

Open userJ123 opened this issue 1 year ago • 4 comments

Hi, I was wondering if anyone could help me figure out why the assignTaxonomy function isn’t running in my code. I know it can sometimes take a while to run, but I’ve let it go for many hours and with the amount of samples it should really only be a few minutes.

I’ve confirmed that the pathway is correct on my computer, tried changing the “multithread=TRUE” statement to “FALSE” and excluding it altogether. I've confirmed that my seq.nochim file is created and accounted for. My R studio is version 4.1.3 and my dada2 package is version 1.22.0.

No error pops up, the code will just run eternally. If I try to manually stop the code, R Studio crashes.

taxa <- assignTaxonomy(seqtab.nochim, "/Users/J/Downloads/RNA-Seq/silva_nr99_v138.1_wSpecies_train_set.fa.gz")

userJ123 avatar Aug 09 '22 19:08 userJ123

How much memory is available on this machine? Running assignTaxonomy with the Silva db is fairly memory intensive, especially the wSpecies version.

benjjneb avatar Aug 09 '22 19:08 benjjneb

The physical memory of the machine is 8GB but it says that the "Memory Used" is 6.32GB. Is there a less memory-intensive version of Silva db that you recommend trying?

userJ123 avatar Aug 09 '22 20:08 userJ123

Try the non wSpecies version, and see if that helps. Test it on a small test dataset.

In my experience, running Silva on 8GB is not recommended. 16GB seems to be fine though.

benjjneb avatar Aug 09 '22 20:08 benjjneb

Thank you so much, the non wSpecies version worked!

userJ123 avatar Aug 10 '22 02:08 userJ123