MTBseq_source icon indicating copy to clipboard operation
MTBseq_source copied to clipboard

Results TBgroups

Open acebollada80 opened this issue 1 year ago • 1 comments

I'm currently facing an issue with the output files while using MTBseq version 1.0.3 via Docker for analyzing 39 different tuberculosis samples. Here's a breakdown of the steps I've taken:

Ran the command MTBseq --step TBfull --samples samples1202.txt --project Febrero. Executed various subsequent steps independently: MTBseq --step TBlist MTBseq --step TBvariants --snp_vars MTBseq --step TBstats MTBseq --step TBstrains MTBseq --step TBjoin --snp_vars --samples samples1202.txt --project Febrero MTBseq --step TBamend --samples samples1202.txt --project Febrero MTBseq --step TBgroups --samples samples1202.txt --project Febrero The issue arises in the output files of the TBGroups step. While the Strain_Classification file correctly classified the samples, I found discrepancies in the output files. Specifically:

The matrix file Febrero_joint_cf4_cr4_fr75_ph4_samples39_amended_u95_phylo_w12.matrix contains only 0's. In the file Febrero_joint_cf4_cr4_fr75_ph4_samples39_amended_u95_phylo_w12_d12.groups, all samples are assigned to "group1". Previously, I encountered similar results when executing MTBseq --step TBvariants. However, I suspected that this might be due to the detection of additional indels and gaps in some samples, leading me to perform the analysis solely with SNPs.

I would greatly appreciate any insights or suggestions on which step might be causing the issue or where to focus to identify the error. Your assistance would be invaluable in resolving this matter.

Thank you all for your time and support.

acebollada80 avatar Feb 15 '24 11:02 acebollada80