EDTA icon indicating copy to clipboard operation
EDTA copied to clipboard

Warning: LOC list - is empty / NO LTRs, SINES or LINES

Open keith-harrison opened this issue 9 months ago • 6 comments

Hi I am wondering if there is any error here. I have done TE predictions for like 8 genomes (around 20MB) in the same family and normally find a high number of TEs and find that this specific species does not have as good results as the rest.

/shared/EDTA/EDTA.pl --genome genome.fasta --overwrite 1 --sensitive 1 
--anno 1 --threads 6 --evaluate 1 --force 1

#########################################################
##### Extensive de-novo TE Annotator (EDTA) v2.2.0  #####
##### Shujun Ou ([email protected])             #####
#########################################################


Parameters: --genome genome.fasta --overwrite 1 --sensitive 1 --anno 1 --threads 6 --evaluate 1 --force 1


Wed  5 Mar 15:14:07 GMT 2025    Dependency checking:
                                All passed!

Wed  5 Mar 15:14:08 GMT 2025    Obtain raw TE libraries using various structure-based programs: 
Wed  5 Mar 15:14:08 GMT 2025    EDTA_raw: Check dependencies, prepare working directories.

Wed  5 Mar 15:14:08 GMT 2025    Start to find LTR candidates.

Wed  5 Mar 15:14:08 GMT 2025    Identify LTR retrotransposon candidates from scratch.

awk: fatal: cannot open file `genome.fasta.mod.pass.list' for reading: No such file or directory
Warning: LOC list - is empty.

Error: Error while loading sequence
Filter sequence based on TEsorter classifications. Unclassified sequences will also be output to the clean file.
        Usage: perl cleanup_misclas.pl sequence.fa.rexdb.cls.tsv
        Author: Shujun Ou ([email protected]) 10/11/2019

mv: cannot stat 'genome.fasta.mod.LTR.intact.fa.ori.dusted.cln.cln': No such file or directory
mv: cannot stat 'genome.fasta.mod.LTR.intact.fa.ori.dusted.cln.cln.list': No such file or directory
cp: cannot stat 'genome.fasta.mod.LTR.intact.raw.fa.anno.list': No such file or directory
ERROR: No such file or directory at /shared/EDTA/util/output_by_list.pl line 39.

        perl filter_gff3.pl file.gff3 file.list > new.gff3

Wed  5 Mar 15:14:20 GMT 2025    Warning: The LTR result file has 0 bp!

Wed  5 Mar 15:14:20 GMT 2025    Start to find SINE candidates.
.... Similar results follow from this.

Thanks, Keith

keith-harrison avatar Mar 05 '25 15:03 keith-harrison

I'm getting nearly the same error using version v2.2.2 (quay.io-biocontainers-edta-2.2.2--hdfd78af_1 image via apptainer). My error includes a cds input and has Error: cd-hit-est is not found in the CDHIT path ! even though the dependency check states All passed!. @keith-harrison are you using a containerized version of EDTA, by chance?

ryandkuster avatar Mar 12 '25 13:03 ryandkuster

@oushujun Hi, I have also installed version v2.2.2 and am using a singularity container (with docker://quay.io/biocontainers/edta:2.2.2--hdfd78af_1). I have the same error as well: Error: cd-hit-est is not found in the CDHIT path !

asthaamishra avatar Mar 12 '25 15:03 asthaamishra

No I use the conda/mamba version which works well for me (redownloaded last week to get the new version finally 👍 ). Also the error I am having has nothing to do with the error you are having since they occur independently, mine is just down to no LTRs being found in some of my genomes :) try to make a new github issue just so its clear for the author of the tool.

keith-harrison avatar Mar 12 '25 16:03 keith-harrison

To answer to Keith's original question, if the genome is this small (20Mb), you may want to consider the possibility that it's missing some types of TEs. If you believe the program runs correctly, you may use --force 1 to bypass the missing types.

For other apptainer questions - I am very unfamilar with singularity/apptainer, sorry. I would love to know the solution if you find out, or ways to improve EDTA or its conda recipe to solve this issue.

Thanks! Shujun

oushujun avatar Mar 20 '25 12:03 oushujun

The cd-hit-est error is cause by LTR_retriever#L282. The which command is not available in the container, at least for singularity. As a stopgap, I'm binding the host's which executable inside the container by adding -B /usr/bin/which to the singularity command:

export PYTHONNOUSERSITE=1
singularity exec -B /usr/bin/which {path}/EDTA.sif EDTA.pl --genome genome.fa [other parameters]

The run hasn't completed yet; however, it has finished finding LTR candidates without any issues.

Adding which (available in the conda-forge repo) to the dependencies should fix it

maa146 avatar Apr 03 '25 13:04 maa146

Thank you @maa146 for this fix! Binding which executable inside the container helped.

asthaamishra avatar Apr 17 '25 15:04 asthaamishra