dada2
dada2 copied to clipboard
Solving the "Error: BiocParallel errors" (in almost every sample)
Hi, I am an undergraduate studying bioinformatics in the microbiome analysis field, and I am having a BiocParallel Error every time I run my code for the ENA database datasets. The same code worked well for NCBI datasets, and I kept on debugging but couldn't solve the problem.
Following is the code I used:
library(dada2) library(ggplot2) library(dplyr)
path <- "/Users/User/Documents/SeniorThesis/zebraENA/reads_fastq/PRJNA806371" list.files(path)
#Assuming your forward and reverse fastq filenames have the format: SRRXXXXXXX_1.fastq and SRRXXXXXXX_2.fastq
fnFs <- sort(list.files(path, pattern="_1\.fastq.gz$", full.names = TRUE))
fnRs <- sort(list.files(path, pattern="2\.fastq.gz$", full.names = TRUE))
#Extract sample names, assuming filenames have the format: SRRXXXXXXX_X.fastq
sample.names <- sapply(strsplit(basename(fnFs), ""), [
, 1)
#INSPECT READ QUALITY PROFILES QPF <- plotQualityProfile(fnFs[1:2])
Error: BiocParallel errors 1 remote errors, element index: 1 0 unevaluated and other errors first remote error: Error in data.frame(sequence = names(freqtbl$top), count = as.integer(freqtbl$top), : arguments imply differing number of rows: 0, 1
I am getting this same error in almost every single public data I used.. Please give me comments how to solve this error. Thanks so much in advance!