alphafold
alphafold copied to clipboard
one error about alphafold,ask for help online
CE8eSpRY.fasta
报错:I0910 11:42:53.106589 139985805367104 hhsearch.py:76] Launching subprocess "hhsearch -i /tmp/tmpd42j2lo8/query.a3m -o /tmp/tmpd42j2lo8/output.hhr -maxseq 1000000 -d /mnt/pdb70/pdb70"
I0910 11:42:53.259523 139985805367104 utils.py:36] Started HHsearch query
I0910 11:42:53.617978 139985805367104 utils.py:40] Finished HHsearch query in 0.358 seconds
Traceback (most recent call last):
File "run_alphafold.py", line 338, in
stderr:
-
11:42:53.447 INFO: /tmp/tmpd42j2lo8/query.a3m is in A2M, A3M or FASTA format
-
11:42:53.447 WARNING: Ignoring invalid symbol '*' at pos. 1623 in line 2 of /tmp/tmpd42j2lo8/query.a3m
-
11:42:53.610 ERROR: [subseq from] CRISPR-associated endonuclease Cas9/Csn1 n=212 Tax=root TaxID=1 RepID=CAS9_STRP1
-
11:42:53.610 ERROR: Error in /opt/conda/conda-bld/hhsuite_1616660820288/work/src/hhalignment.cpp:1244: Compress:
-
11:42:53.610 ERROR: sequences in /tmp/tmpd42j2lo8/query.a3m do not all have the same number of columns,
-
11:42:53.610 ERROR: e.g. first sequence and sequence UniRef90_Q99ZW2/2-1048.
-
11:42:53.610 ERROR: Check input format for '-M a2m' option and consider using '-M first' or '-M 50'
I got the same error.
11:42:53.447 WARNING: Ignoring invalid symbol '*' at pos. 1623 in line 2 of /tmp/tmpd42j2lo8/query.a3m
Don't you have '*' at the end of the query sequence?
Yes, I have this * at the end the protein, and I am facing identical error. I would run again without * this one
OK! Unbelievably, the so-called "WARNING" is what causing the error! By just removing the asterisk symbol from the end of my sequence, my problem was resolved.