sarek icon indicating copy to clipboard operation
sarek copied to clipboard

Wrong sample names in multiqc general table

Open LouisLeNezet opened this issue 1 year ago • 3 comments

Description of the bug

Hi, When using sarek for the mapping steps the names in the multiqc general statistics table are a bit strange image In the image above I think that 1-ADN13260_1.md and 1-ADN13260_1-lane1 should both be named 1-ADN13260_1 and that 1-ADN13260_1.1-ADN13260_1-lane1_1 should simply be ADN13260_1-lane1_1.

Command used and terminal output

nextflow run nf-core/sarek -profile singularity -r 3.4.4 --step "mapping" --skip_tools : "mosdepth,baserecalibrator" --outdir results --input file.csv

Relevant files

No response

System information

No response

LouisLeNezet avatar Sep 19 '24 08:09 LouisLeNezet

Could you share your file.csv with us?

asp8200 avatar Sep 19 '24 10:09 asp8200

The file just looked like:

patient,sample,lane,fastq_1,fastq_2
14123,1-ADN13364_1,lane1,R1.fastq.gz,R2.fastq.gz

LouisLeNezet avatar Sep 19 '24 11:09 LouisLeNezet

Hey! This is because the md files are spanning all input files belonging to a single sample. The Read QC on the other hand are computed for each fastq file separately. This should be massively simplified with the latest MultiQC release that allows grouping of QC stats that are split over several files. WE just need to add Fastp support for this in MultiQC itself

FriederikeHanssen avatar Sep 25 '24 09:09 FriederikeHanssen