rnavar icon indicating copy to clipboard operation
rnavar copied to clipboard

Missing STAR index not created automaticaly

Open kunstner opened this issue 3 years ago • 5 comments

Description of the bug

Hi, I'm running rnavar v1.0.0 on GRCm38. Genomes and indices were downloaded using igenomes. I got troubles during the STAR alignment step. According to documentation, a new STAR index should automatically be created if the index version does not match the installed STAR version (https://github.com/nf-core/rnavar/pull/37#issue-1197002410). However, this process does not start and the workflow ends with an error

Error executing process > 'NFCORE_RNAVAR:RNAVAR:ALIGN_STAR:STAR_ALIGN (s9)'

Any ideas how I can fix this quickly so that the index gets created on the fly?

Many thanks, Axel

Command used and terminal output

nextflow run nf-core/rnavar -r 1.0.0 \
	-profile singularity \
	-resume \
	--igenomes_base ./references \
	--input samplesheet.csv --genome GRCm38 \
	--dbsnp references/Mus_musculus/Ensembl/GRCm38/Annotation/Variation/Mus_musculus.vcf.gz \
	--dbsnp_tbi references/Mus_musculus/Ensembl/GRCm38/Annotation/Variation/Mus_musculus.vcf.gz.tbi \
	--outdir ./results_single_end \
	--annotate_tools vep \
	--genome GRCm38 \
	--skip_variantannotation


Execution cancelled -- Finishing pending tasks before exit
-[nf-core/rnavar] Pipeline completed with errors-
Error executing process > 'NFCORE_RNAVAR:RNAVAR:ALIGN_STAR:STAR_ALIGN (s9)'

Caused by:
  Process `NFCORE_RNAVAR:RNAVAR:ALIGN_STAR:STAR_ALIGN (s9)` terminated with an error exit status (105)

Command executed:

  STAR \
      --genomeDir STARIndex \
      --readFilesIn GFI1_36S_S_MLL_AF9_3.fastq.gz  \
      --runThreadN 12 \
      --outFileNamePrefix s9. \
       \
      --sjdbGTFfile genes.gtf \
      --outSAMattrRGline ID:s9 'SM:s9' 'PL:illumina'  \
      --outSAMtype BAM SortedByCoordinate --readFilesCommand zcat  --sjdbOverhang 149 --twopassMode Basic  --outBAMsortingBinsN 50 --limitOutSJcollapsed 1000000
  
  
  
  if [ -f s9.Unmapped.out.mate1 ]; then
      mv s9.Unmapped.out.mate1 s9.unmapped_1.fastq
      gzip s9.unmapped_1.fastq
  fi
  if [ -f s9.Unmapped.out.mate2 ]; then
      mv s9.Unmapped.out.mate2 s9.unmapped_2.fastq
      gzip s9.unmapped_2.fastq
  fi
  
  cat <<-END_VERSIONS > versions.yml
  "NFCORE_RNAVAR:RNAVAR:ALIGN_STAR:STAR_ALIGN":
      star: $(STAR --version | sed -e "s/STAR_//g")
  END_VERSIONS

Command exit status:
  105

Command output:
  	STAR --genomeDir STARIndex --readFilesIn GFI1_36S_S_MLL_AF9_3.fastq.gz --runThreadN 12 --outFileNamePrefix s9. --sjdbGTFfile genes.gtf --outSAMattrRGline ID:s9 SM:s9 PL:illumina --outSAMtype BAM SortedByCoordinate --readFilesCommand zcat --sjdbOverhang 149 --twopassMode Basic --outBAMsortingBinsN 50 --limitOutSJcollapsed 1000000
  	STAR version: 2.7.9a   compiled: 2021-05-04T09:43:56-0400 vega:/home/dobin/data/STAR/STARcode/STAR.master/source
  Aug 08 09:51:40 ..... started STAR run
  Aug 08 09:51:41 ..... loading genome

Command error:
  WARNING: Ignoring /data/sb_service_01/ak_20220805_SomaticSignatures_174 bind mount: user bind control disabled by system administrator
  WARNING: Ignoring /home/kuenstner/.nextflow/assets/nf-core/rnavar/bin bind mount: user bind control disabled by system administrator
  WARNING: Ignoring /data/sb_service_01/ak_20220805_SomaticSignatures_174/work/dc/0bc41c5077de6d940513d659ce71d4 bind mount: user bind control disabled by system administrator
  WARNING: Not mounting current directory: user bind control is disabled by system administrator
  
  EXITING because of FATAL ERROR: Genome version: 20201 is INCOMPATIBLE with running STAR version: 2.7.9a
  SOLUTION: please re-generate genome from scratch with running version of STAR, or with version: 2.7.4a
  
  Aug 08 09:51:41 ...... FATAL ERROR, exiting

Work dir:
  /data/sb_service_01/ak_20220805_SomaticSignatures_174/work/dc/0bc41c5077de6d940513d659ce71d4

Tip: you can replicate the issue by changing to the process work dir and entering the command `bash .command.run`

Relevant files

No response

System information

No response

kunstner avatar Aug 08 '22 08:08 kunstner

@kunstner Thanks for reporting this. We are aware of this and purposefully skipped it as there were some technical challenges while implementing this check. We will certainly take this into account in the next patch release.

praveenraj2018 avatar Aug 08 '22 08:08 praveenraj2018

Hi @praveenraj2018

Has this been already fixed in the dev branch?

bounlu avatar Oct 11 '22 01:10 bounlu

got the same problem here @praveenraj2018. is there a suggested way around while you manage to implement a fix? maybe a star-indexed local reference?

lescai avatar Nov 21 '22 14:11 lescai

You can easily build the star index yourself. Just make sure you use the same version of STAR as RNAvar (e.g. STAR 2.7.9a).

The following command should do the job nicely:

STARINDEX=star_index fasta=$REFERENCE_DIR/Mus_musculus/Ensembl/GRCm38/Sequence/WholeGenomeFasta/genome.fa gtf=$REFERENCE_DIR/Mus_musculus/Ensembl/GRCm38/Annotation/Genes/genes.gtf

STAR --runMode genomeGenerate --genomeDir $STARINDEX --genomeFastaFiles $fasta --sjdbGTFfile $gtf --runThreadN 32

kunstner avatar Nov 22 '22 06:11 kunstner

The star parameter is still set in conf/igenomes.config, so the pipeline thinks a star index is provided.

Remove or comment the star = "${params.igenomes_base}... line, this will trigger the STAR_GENOMEGENERATE process to run.

WhoisDonlee avatar Jul 13 '23 14:07 WhoisDonlee