snp-sites icon indicating copy to clipboard operation
snp-sites copied to clipboard

error: alignment contains sequences of unequal length

Open rderelle opened this issue 3 years ago • 8 comments

Hi,

I have generated an alignment between 2 genomes using progressiveMauve (default parameters) and I'm now trying to extract SNPs using snp-sites.

My issue is that I get the error message: '' Alignment my_ali contains sequences of unequal length. Expected length is 42875 but got 42876 in sequence ''

However both sequences have a length of 42876, but both sequences have 1 indel '-'.

any idea about how to fix that ?

thanks Romain

rderelle avatar Feb 21 '21 09:02 rderelle

I'm getting the same error while trying to extract SNPs from MAFFT alignments of coronavirus, courtesy of GISAID

(pangenomics) ubuntu@ip-172-31-29-49:~/covid19/vcf$ snp-sites -v -o GISAID0201 ../gisaid_hcov-19_2021_01_30_08.fasta.gz
Alignment ../gisaid_hcov-19_2021_01_30_08.fasta.gz contains sequences of unequal length. Expected length is 29766 but got 29768 in sequence hCoV-19/Japan/PG-9988/2020|EPI_ISL_902711|2020-07

Also, wasn't there an option to output a VCF per sample?

dbrami avatar Feb 23 '21 14:02 dbrami

@rderelle @dbrami got any solution for this? any other tool or something?

Logan1x avatar Mar 24 '21 16:03 Logan1x

@Logan1x I have abandoned Mauve, which doesn't seem to work well. It is actually easy to get a list of SNPs from Minimap2 (using third party pgm provided in the same package) or GSalign (directly from the pgm).

cheers

rderelle avatar Mar 24 '21 16:03 rderelle

Sanger iInstitute doesn’t support this software and its the bug fix anymore and original authors left so no issue resolution in sight.

On Wed, Mar 24, 2021 at 9:08 AM romain derelle @.***> wrote:

@Logan1x https://github.com/Logan1x I have abandoned Mauve, which doesn't seem to work well. It is however easy to get a list of SNPs from Minimap2 (using third party pgm provided in the same package) or GSalign (directly from the pgm).

cheers

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/sanger-pathogens/snp-sites/issues/101#issuecomment-805956254, or unsubscribe https://github.com/notifications/unsubscribe-auth/AABH7W5ROJKPKAB2OU7SJNLTFIFAHANCNFSM4X6Y6E5Q .

-- Sent from Gmail Mobile

dbrami avatar Mar 25 '21 04:03 dbrami

fxxk!

Albertzhuo123 avatar Apr 04 '22 06:04 Albertzhuo123

Hi, I'm wondering if folks might recommend a similar tool to extract SNPs from a multiple sequence alignment to generate an alignment of SNPs only? I am similarly running into errors with Expected length although I have confirmed that sequences are the same length.

Thank you in advance!

ksw9 avatar May 10 '22 18:05 ksw9

Hi,

Did anybody find solution?

Thanks

ankeetkumar avatar Mar 19 '23 05:03 ankeetkumar