AbLang icon indicating copy to clipboard operation
AbLang copied to clipboard

Data pre-processing pipeline

Open amoskalev opened this issue 1 year ago • 1 comments

Hi, thanks for your work!

Can advise me the on data pre-processing pipeline that you used? Which OAS columns did you use to transcribe and translate DNAs to antibodies? How did you implement the transcription? In general the question is how did you go from raw OAS data to the data used in the paper. Thanks!

Would it be possible to release data-processing pipeline?

amoskalev avatar Oct 25 '23 11:10 amoskalev

I have the same question. The paper mentions that 40% of OAS sequences are missing residues at the N-terminus. Were they filtered out or were the missing amino acids masked as unknown?

tsjain avatar Dec 07 '23 18:12 tsjain