galig icon indicating copy to clipboard operation
galig copied to clipboard

Need help to run the tool, no result for sample.mem

Open amrita1983 opened this issue 5 years ago • 13 comments

Hi, I am currently working on RNA seq data (Paired end), of human sample, I need to check the alternative splicing event of a specific gene "NEK1" in the sample data, and may be after that for all the genes, my sample has no replicate.I was trying to use ASGAL tool, but while I am giving the input for SpliceAwareAligner tools for generatinig sample.mem file form fastq files nothing is coming up as output. I am using the annotation gtf for that gene only extracted from hg19 human genome gtf. Here is the command I am usingbin/SpliceAwareAligner -g hg19.fa -a annotation_NEK1.gtf -s NEK1_NM1.fastq -o asgal/NEK1_NM1_output.memPlease help me to run the tool I need the data urgently, can you please help with this?

amrita1983 avatar Oct 04 '19 12:10 amrita1983

Hi, if the process ends without any error and the .mem file is empty, then there could be two reasons behind this behavior:

  • none of the input reads align to the splicing graph of that gene, maybe it is necessary to use different values of the parameters -l and -e or maybe most of the reads align to novel exons, that are currently not fully support by ASGAL
  • there is some problem in the input data. For example, the format of some input file (such as the GTF) is different from the one expected by ASGAL

If it is possible, can you please share your data with me? It would be easier for me to troubleshoot the problem.

Edit: are you using the version on the master branch?

ldenti avatar Oct 07 '19 07:10 ldenti

Hi, Thanks for the reply. I am sharing the data in fastq file reads (for the NEK1 gene sequences that I have extracted from alignment BAM file of the sample) which I have used in your tool as input for the program SpliceAwareAligner with the following annotation.gtf that I have attached whcih is having the coordinates of NEK1 gene exons, which I have extracted from human hg19. gtf. command I have used: /bin/SpliceAwareAligner -g hg19.fa -a annotation_NEK1.gtf -s NEK1_NM1.fastq -o asgal/NEK1_NM1_output.mem

python3 scripts/detectEvents.py -g hg19.fa -a annotation_NEK1.gtf -m asgal/NEK1_NM1_output.mem -o asgal/NEK1_NM1_output.events.csv I need to check the AS events for this gene NEK1 only for the time being. It would be a great help if you can test with these files. Thanks in advance NEK1_NM1.fastq.gz

annotation_NEK1.gtf.gz

amrita1983 avatar Oct 07 '19 10:10 amrita1983

Ok, I think I figured out what the problem is. If you directly use the SpliceAwareAligner executable, you should not use the full genome as reference (ie the hg19.fa) but you should use the sequence of the chromosome the gene belongs to. In our case, you should use this file. I tried with this file and I got a non-empty .mem file (NEK1_NM1.mem.tar.gz).

Let me know if this helps you

ldenti avatar Oct 07 '19 12:10 ldenti

Thanks for the reply and initiative. Sorry ,but I didn't see any output in that NEK1_NM1.mem.tar.gz file that you have attached. It has a file called aligns.mem which is empty :( I tried with the same chr4.fa after indexing it with samtools faidx, and used the same annotation_NEK1.gtf.gz file as gtf (attached earlier) but still that *.mem file is empty for me. Not getting what is wrong ??

amrita1983 avatar Oct 09 '19 06:10 amrita1983

Oh, I'm sorry... I attached the wrong file. This is the correct one.

In any case, it's very strange that you are still getting an empty .mem file. Can you please paste here the exact command you are running?

ldenti avatar Oct 09 '19 07:10 ldenti

Oh, thanks for the correct file. Here is the command that I am using /bin/SpliceAwareAligner -g chr4.fa -a annotation_NEK1.gtf -s NEK1_NM1.fastq -o asgal/NEK1_NM1_output.mem

I have tried the next command detectEvents.py using the .mem file that you have sent, there also I have got a lot of error messages !! (attaching file) error_msg_detectEvents.txt

python3 scripts/detectEvents.py -g chr4.fa -a annotation_NEK1.gtf -m asgal/NEK1_NM1.mem -o asgal/NEK1_NM1_output.events.csv

I hope I am using all the correct commands.

amrita1983 avatar Oct 09 '19 08:10 amrita1983

It's strange that you are still getting an empty .mem file. Does SpliceAwareAligner return any error?

Regarding the detectEvents.py script, if a file annotation_NEK1.gtf.db exists, try to remove it and rerun the script.

ldenti avatar Oct 09 '19 10:10 ldenti

No, SpliceAwareAligner did not return any error in nohup also. I have oe more sample to run with same command that you have used for SpliceAwareAligner.

For detectEvents now after removing gtf.db that .csv generated with headers line only. Here is one more sample file I have to run for the entire set.

NEK1_AF2.fastq.gz

amrita1983 avatar Oct 09 '19 11:10 amrita1983

This is the .mem file for the new sample. If the CSV is empty, it means that no events have been found. Maybe using a lower value for the parameter -l may change something

ldenti avatar Oct 09 '19 13:10 ldenti

Anyway, I have no idea why you obtain an empty .mem file. The command you use seems the right one... If I manage to understand why, I'll let you know

ldenti avatar Oct 09 '19 13:10 ldenti

Sure I will check to change the parameters, thanks for the help. Such a prompt help is really great help for my work. Please check if you can find some issue and let me know.

amrita1983 avatar Oct 10 '19 06:10 amrita1983

Can you please send me the command that you are using for SpliceAwareAligner, actually I want to see the command as well as the parameters I want ot change -l and -e but I cannot do it in my system still its not giving me any output

amrita1983 avatar Oct 10 '19 09:10 amrita1983

The command I used is:

./bin/SpliceAwareAligner -l 15 -g /data/amrita1983_git/chr4.fa -a /data/amrita1983_git/annotation_NEK1.gtf -s /data/amrita1983_git/NEK1_NM1.fastq -o /data/amrita1983_git/NEK1_NM1.mem

ldenti avatar Oct 12 '19 11:10 ldenti