EDTA icon indicating copy to clipboard operation
EDTA copied to clipboard

Helitron results file has 0bp!

Open Toffeeladd opened this issue 8 months ago • 6 comments

Hi there, thank you for such a great software!

I am currently using EDTA to annotate TEs in two plant genomes assembled through PacBio hifi long reads. The repeat content of the two genomes estimated from redmask (https://github.com/nextgenusfs/redmask). I give some brief details of the genomes below:

Species 1 Genome size = 3.2gb Contigs = 13,882 Repeat content = 70.%

Species 2 Genome size = 2gb Contigs = 299 Repeat content = 68%

I have opted for the divide and conquer approach using EDTA_raw.pl (version=EDTA/2.0.1) due to time constraints on my HPC. It has so far worked well for finding TIR and LTR raw TEs in Species 2 however it is struggling to find Helitrons in both. I am inclined to believe that they exist in these genomes as closely related species from the same family (Rubiaceae) have them. Essentially I am unsure whether this is due to an error in helitron scanner or the sensitivity of the search. (I have relatively simple contig headers so I don't believe that to be the problem). It seems to fill some files in the directory but others are empty (the log file and empty files in respective directories are the same in both species so I have only provided one example below) Any help would be great thanks!

log_file:

Wed Oct 18 16:59:37 BST 2023 EDTA_raw: Check dependencies, prepare working directories.

Wed Oct 18 17:00:01 BST 2023 Start to find Helitron candidates.

Wed Oct 18 17:00:01 BST 2023 Identify Helitron candidates from scratch.

Error: Error while loading sequence perl make_bed_with_intact.pl EDTA.intact.fa > EDTA.intact.bed

Thu Oct 19 07:18:11 BST 2023 Warning: The Helitron result file has 0 bp!

Thu Oct 19 07:18:11 BST 2023 Execution of EDTA_raw.pl is finished!

Species2.fa.mod.EDTA.raw Directory:

4096 Oct 19 07:18 Helitron 4096 Oct 18 17:00 LTR 0 Oct 19 07:18 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.Helitron.intact.bed 0 Oct 19 07:18 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.Helitron.intact.fa 125 Oct 19 07:18 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.Helitron.intact.gff3 0 Oct 19 07:18 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.Helitron.raw.fa 4096 Oct 18 17:00 TIR

Helitron Directory:

62 Oct 18 17:00 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod -> ../../S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod 0 Oct 19 07:18 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.Helitron.intact.bed 0 Oct 19 07:18 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.Helitron.intact.fa 125 Oct 19 07:18 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.Helitron.intact.gff3 0 Oct 19 07:18 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.Helitron.raw.fa 20565256 Oct 18 21:42 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.draw.hel.fa 18463144 Oct 19 06:44 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.draw.rc.hel.fa 3115414 Oct 19 06:45 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.filtered.ext.fa 53965 Oct 19 07:18 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.filtered.ext.fa.cov0.9iden90.tabout 0 Oct 19 06:45 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.filtered.ext.fa.pass.fa 3095674 Oct 19 06:45 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.filtered.fa 0 Oct 19 07:18 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.filtered.fa.pass.fa 0 Oct 19 07:18 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.filtered.fa.pass.fa.dusted 0 Oct 19 07:18 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.filtered.fa.pass.fa.dusted.cleanup 0 Oct 19 07:18 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.filtered.fa.pass.fa.dusted.cln 62917 Oct 19 06:45 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.filtered.tabout 144237393 Oct 18 19:11 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.head 52260 Oct 18 21:42 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.pairends 36177959 Oct 19 06:45 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.raw.ext.fa 280115 Oct 19 06:45 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.raw.ext.list 144710515 Oct 18 23:29 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.rc.head 53142 Oct 19 06:44 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.rc.pairends 8971624 Oct 19 06:44 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.rc.tail 8898722 Oct 18 21:42 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.HelitronScanner.tail 20480 Oct 19 06:45 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.ndb 3596 Oct 19 06:45 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.not 16384 Oct 19 06:45 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.ntf 1200 Oct 19 06:45 S.wilk_nuclear_4n.asm.bp.p_ctg.fa.uncont.filtered.fa.mod.nto

Toffeeladd avatar Oct 27 '23 15:10 Toffeeladd