diamond icon indicating copy to clipboard operation
diamond copied to clipboard

About the '--masking 0' option

Open ZhangBioLab opened this issue 1 year ago • 3 comments

Hi, I'm curious about why, despite setting --masking 0, the sseq output still contains many results with 'XXXXXXXX'. like: AERLDEVAAQRHCLTDRFHGGGQGGIGTGELLEREPXRLDHHVVQGGFETGRRFPCDVVDDLVEGVTDGQFGGDLGDRKAGRLGRQXXGTRHXRVHLDDDQPTVARVDRELDVAAAGVHTHLAQDRDAQVAHPLVFXVGQRHRXXXXXXXXXXHTHRVDVFDRAHHHHVVVAVAHQLEFEFLPAVNRFLDEHVGAGR-GRQPXXXXXXXXVGGVRYPRTQPAHGEARPXXXXXXXXXDRLTHFGXGETHSAPGGFATGLGXDVLEPLPVLAXLDGVXXXADEFHAVLFQHPALVQRDRGVQRGLPTQGRQQGVDLVAPLGLLGDNPLHERRGDGLYVGVVGELRVGHDGGRIRVHQADLQALGAQHPARLSPXVVELARLADDDRPGXXDQHVVXIGATGH Thank you!

ZhangBioLab avatar Feb 08 '24 08:02 ZhangBioLab

I could not reproduce the issue when using --masking 0. Please double check that your input sequences don't already contain the X.

bbuchfink avatar Feb 13 '24 10:02 bbuchfink

Sorry, I just saw your reply now. The amino acid in the sequence I entered was originally U, but Diamond was automatically replaced with X, it looks like this: VERFLEGSADGHRLAHGLHRSGKEILRPGKLFKREPGHFHHAVIDGGLERSPGLPGDVVGDLVQGIPHGQLGGDLGNGKPRRLGCEGRXPGDPGVHLYDDHLPVGGVDGELDVGPPRLHADFPQNRNRGVPQQLIFPVGQGLGRSHRDRIPRMDAHGVHVLDGADDDHIVHAVAHDLELEFLPAEHRLL-EHDGVNETGIQPALGQFLQFFPVVGHAAPRAAQRERRPHDDRETDLPGNGFHFRHGTRNAAGRNAQPDPLHGIAEQFPVFGFLDDFNTRSDESHAETFEHTRFGHAHRHVQGRLPAQGGQQRVGTL-PL----DHLRHRFGRDRLDIGAVGRFRIGHYCCGVAVDQDNLVPFLAQCLAGLGPGVVELARLADDDGAGSDDQYLSYVGSLGH

ZhangBioLab avatar Feb 27 '24 14:02 ZhangBioLab