stringdecomposer icon indicating copy to clipboard operation
stringdecomposer copied to clipboard

SD--tsv

Open duhuipeng opened this issue 4 years ago • 4 comments

Dear author I'd like to ask you,I run through your code ,generates 3 tsv suffix files,as follow: image I'd like to ask which file I should mainly look at. What I' m looking at now is final_decomposition.tsv, This column of the document ,Because I want to predict SV now, I want to know if I can do it Looking forward to your reply

duhuipeng avatar Oct 28 '20 03:10 duhuipeng

Dear author Can you explain this sentence,I can not understand.why (i+1,j)represent insertion, (j+1,i)represent deletions, and so on image

duhuipeng avatar Oct 28 '20 15:10 duhuipeng

Hi,

Thank you for your interest in String Decomposer! The final output of the tool is at final_decomposition.tsv.

Wrt your second question --- this is just how the graph is defined. It is pretty much analogous to the matrix alignment of two sequences (see for example, https://en.wikipedia.org/wiki/Needleman%E2%80%93Wunsch_algorithm).

Thanks, Andrey

seryrzu avatar Oct 29 '20 20:10 seryrzu

Dear author image image What I want to ask is that in this final_decomposition.tsv,I'm mainly looking at which column to see it structural variation? Is it the third column with letters? Looking forward to your reply Best

duhuipeng avatar Nov 04 '20 03:11 duhuipeng

Hi!

Thank you again for your interest in StringDecomposer. File final_decomposition.tsv has the following columns (from left to right):

  1. Sequence name (usually read or assembly)
  2. Best aligned monomer name (it has ' at the end if the alignment is reverse complement)
  3. Alignment start position on sequence
  4. Alignment end position on sequence
  5. Alignment identity score
  6. Second best aligned monomer name
  7. Second best aligned monomer identity score
  8. Best aligned monomer name, if homopolymers collapsed (like GGGG -> G) in both sequences.
  9. Alignment score for the best monomer with collapsed homopolymers.
  10. Second best aligned monomer with collapsed homopolymers.
  11. And its score.

Sorry for late response!

Thank you, Tanya

TanyaDvorkina avatar Nov 11 '20 18:11 TanyaDvorkina