stringdecomposer
stringdecomposer copied to clipboard
SD--tsv
Dear author
I'd like to ask you,I run through your code ,generates 3 tsv suffix files,as follow:
I'd like to ask which file I should mainly look at. What I' m looking at now is final_decomposition.tsv, This column of the document
Dear author
Can you explain this sentence,I can not understand.why (i+1,j)represent insertion, (j+1,i)represent deletions, and so on
Hi,
Thank you for your interest in String Decomposer! The final output of the tool is at final_decomposition.tsv
.
Wrt your second question --- this is just how the graph is defined. It is pretty much analogous to the matrix alignment of two sequences (see for example, https://en.wikipedia.org/wiki/Needleman%E2%80%93Wunsch_algorithm).
Thanks, Andrey
Dear author
What I want to ask is that in this final_decomposition.tsv,I'm mainly looking at which column to see it structural variation?
Is it the third column with letters?
Looking forward to your reply
Best
Hi!
Thank you again for your interest in StringDecomposer. File final_decomposition.tsv has the following columns (from left to right):
- Sequence name (usually read or assembly)
- Best aligned monomer name (it has ' at the end if the alignment is reverse complement)
- Alignment start position on sequence
- Alignment end position on sequence
- Alignment identity score
- Second best aligned monomer name
- Second best aligned monomer identity score
- Best aligned monomer name, if homopolymers collapsed (like GGGG -> G) in both sequences.
- Alignment score for the best monomer with collapsed homopolymers.
- Second best aligned monomer with collapsed homopolymers.
- And its score.
Sorry for late response!
Thank you, Tanya