EVcouplings icon indicating copy to clipboard operation
EVcouplings copied to clipboard

Add 'existing paired alignment' to complex pipeline

Open njrollins opened this issue 5 years ago • 2 comments

For some applications, such as protein-RNA complexes, I'd like to pair sequences by a custom method- and input already paired seqs (2 alignments where sequence X in ali 1 is paired with sequence X in ali 2) into pipeline.

This means skipping the 'genome distance' or 'best hit' step in the concatenation pipeline

njrollins avatar Aug 07 '20 20:08 njrollins

Yep definitely a good idea - pinging @aggreen re protein-protein pipeline

For protein-RNA, besides the obvious hack of putting the run through the protein complex pipeline, we will need to think this through in more detail, and maybe finally create the appropriate protocols and pipelines...

thomashopf avatar Aug 08 '20 20:08 thomashopf

Yeah, the hack version of doing this now is if you use the best hit pipeline, use 'input existing alignment' for both of the monomer stages, and provide an input annotation file that has a unique identifier for each pair of sequences that you want to pair. Then best hit will just pair up all the sequences with the same annotation. But I agree it deserves a formal solution

aggreen avatar Aug 08 '20 21:08 aggreen