rna-seq-star-deseq2
rna-seq-star-deseq2 copied to clipboard
rule rseqc_gtf2bed failing with test dataset
Hi,
Thanks for all your work creating snakemake and putting together this pipeline!
I am running into an issue when trying to run the pipeline with the test data, particularly the rsec_gtf2bed rule. I am using snakemake version 5.7.0 (on a cluster). The error I get is pretty complicated and I am having trouble interpreting it. I think the first few error messages about the JSON Schema are unrelated. I have attached the output below of the command: snakemake --use-conda rseqc_gtf2bed
[user]$ snakemake --use-conda rseqc_gtf2bed No validator found for JSON Schema version identifier 'http://json-schema.org/draft-06/schema#' Defaulting to validator for JSON Schema version 'http://json-schema.org/draft-04/schema#' Note that schema file may not be validated correctly. No validator found for JSON Schema version identifier 'http://json-schema.org/draft-06/schema#' Defaulting to validator for JSON Schema version 'http://json-schema.org/draft-04/schema#' Note that schema file may not be validated correctly. No validator found for JSON Schema version identifier 'http://json-schema.org/draft-06/schema#' Defaulting to validator for JSON Schema version 'http://json-schema.org/draft-04/schema#' Note that schema file may not be validated correctly. Building DAG of jobs... Creating conda environment envs/gffutils.yaml... Downloading and installing remote packages. Environment for envs/gffutils.yaml created (location: .snakemake/conda/a2417d57) Using shell: /usr/bin/bash Provided cores: 1 Rules claiming more threads will be scaled down. Job counts: count jobs 1 rseqc_gtf2bed 1
[Tue Oct 15 14:25:51 2019] rule rseqc_gtf2bed: input: .test/data/ref/annotation.chr21.gtf output: qc/rseqc/annotation.bed, qc/rseqc/annotation.db log: logs/rseqc_gtf2bed.log jobid: 0
Activating conda environment: /data/project/rna-seq-star-deseq2-public-pipeline/.snakemake/conda/a2417d57
Traceback (most recent call last):
File "/data/project/rna-seq-star-deseq2-public-pipeline/.snakemake/scripts/tmp1qu6f74g.gtf2bed.py", line 3, in
RuleException: CalledProcessError in line 14 of /data/project/rna-seq-star-deseq2-public-pipeline/rules/qc.smk: Command 'source /share/apps/rc/software/Anaconda3/5.3.1/bin/activate '/data/project/rna-seq-star-deseq2-public-pipeline/.snakemake/conda/a2417d57'; set -euo pipefail; python /data/project/rna-seq-star-deseq2-public-pipeline/.snakemake/scripts/tmp1qu6f74g.gtf2bed.py' returned non-zero exit status 1. File "/data/project/rna-seq-star-deseq2-public-pipeline/rules/qc.smk", line 14, in __rule_rseqc_gtf2bed File "/share/apps/rc/software/Python/3.6.6-foss-2018b/lib/python3.6/concurrent/futures/thread.py", line 56, in run Shutting down, this might take some time. Exiting because a job execution failed. Look above for error message Complete log: /data/project/rna-seq-star-deseq2-public-pipeline/.snakemake/log/2019-10-15T142444.388725.snakemake.log
I apologize in advance if this is due to an error on my part but I welcome any insight you may have.
Thanks, Sam
I tried running the rule with a different version of snakemake (version 5.2.4) and it successfully completed. It appears to be an issue with that specific version of snakemake (version 5.7.0).
I'm hoping this has not reappeared with newer snakemake versions. Please feel free to reopen if this happens again.