starfish icon indicating copy to clipboard operation
starfish copied to clipboard

interpreting starfish insert error

Open metalichen opened this issue 9 months ago • 4 comments

Hi Emile,

Thanks for the great tool! I've been running it on a couple of genomes I'm working with. I followed your tutorial and was able to idenitify some tyr genes there. However, when I got to the element annotation step I got the following:

[Mon May 20 15:37:25 2024] executing command: starfish insert -T 2 -a analysis_and_temp_files/12_starship/ome2assembly.txt -d analysis_and_temp_files/12_starship/blastdb/macpha6.assemblies -b analysis_and_temp_files/12_starship/geneFinder/macpha6.tyr.bed -i tyr -x macpha6 -o analysis_and_temp_files/12_starship/elementFinder/
perl: warning: Setting locale failed.
perl: warning: Please check that your locale settings:
        LANGUAGE = (unset),
        LC_ALL = (unset),
        LANG = "en_GB.UTF-8"
    are supported and installed on your system.
perl: warning: Falling back to the standard locale ("C").
Key parameters:
--upstream             0-17000
--downstream           0-14000
--length               2000-700000
--pid                  90
--hsp                  1000
minDR                  4
maxDR                  40
maxEmptySiteLength     2000
maxElementLengthFlag   700000
minElementLengthFlag   2000
maxInsertCoverage      0.25
maxFlankCoverage       0.25
blastn                 -task dc-megablast -evalue 0.0001 -max_target_seqs 1000000
nucmer                 --mum
delta-filt             -m -l 1000 -i 90

[Mon May 20 15:37:25 2024] reading in data..
[Mon May 20 15:37:25 2024] parsing upstream regions of 11 candidate tyr captains..
[Mon May 20 15:37:25 2024] searching for hits to the upstream regions of 11 candidate tyr captains..
[Mon May 20 15:37:37 2024] found 4 tyr captains with candidate insertion sites
[Mon May 20 15:37:37 2024] searching for hits to the downstream regions of candidate insertion sites for 4 tyr captains that are also downstream of those captains..
[Mon May 20 15:37:42 2024] filtering out similar upstream and downstream flanks of insertion sites associated with 0 candidate tyr captains..
[Mon May 20 15:37:43 2024] refining boundary predictions of 0 candidate tyr captains with insertion sites..
BLAST options error: File analysis_and_temp_files/12_starship/elementFinder//emptyDB.macpha6.fa is empty


[Mon May 20 15:37:43 2024] error: could not execute makeblastdb on commandline, exiting..
Inappropriate ioctl for device

Does this mean that the 4 tyr genes that had upstream candidate insertion sites didn't have downstream sites, and that caused the error? If so, could it be because of the assembly fragmentation? One of the assemblies I'm working with isn't very contiguos (~60 contigs), so I thought it might be a problem here.

Thanks! Gulnara

metalichen avatar May 20 '24 15:05 metalichen