Add Aligned Query Seq/Subject Seq to M8 output?
Would it be possible to add an option to --outfmt (for .m8/.m9 output file extensions) for searchp and searchn to return the aligned portions of the query sequence/subject sequence?
Thank you
In theory that's possible, however development focus has switched to making the new version of lambda: lambda3.
I will see how much work this feature would be for lambda2, though.
Understood. Thank you!
Hello,
Following up on the above item to see if it's in development for Lambda3. We're processing large amounts of data in pairwise format because we need the aligned portions of the query/subject sequences (including gaps when applicable) - if it would be possible to include these in the tabular output in Lambda3 it would be awesome!
We would need to get changes into SeqAn2 for this. I am not sure that is still possible, but I will investigate!
Hey @h-2 - saw some activity on this repo! Are there any updates on when LAMBDA3 will be released?
Hi! Yes, there is quite a bit of development and many improvements and new features on the horizon. I don't have an exact date, yet, but the protein mode on the lambda3 branch is already fairly stable.
Regarding this feature request: We will still be using the SeqAn2 code for writing the output files. This means that we will not be able to do major changes to the Blast output, unfortunately.
I currently see the following options:
- Use
.m0and parse the strings from that. Not ideal, because it is an ugly format, but possible. - Use the positions printed in
.m8and extract manually. I guess, you are doing something similar right now, but it is not very ergonomic of course. - Use SAM or BAM output. They currently already offer printing the matching query sequence (SEQ field / qs field) + the hard-clip option. Interfaces for this are more modular, and we would be able to add an option for also printing the subject sequence if this is important to you!