AbLang
AbLang copied to clipboard
Data pre-processing pipeline
Hi, thanks for your work!
Can advise me the on data pre-processing pipeline that you used? Which OAS columns did you use to transcribe and translate DNAs to antibodies? How did you implement the transcription? In general the question is how did you go from raw OAS data to the data used in the paper. Thanks!
Would it be possible to release data-processing pipeline?
I have the same question. The paper mentions that 40% of OAS sequences are missing residues at the N-terminus. Were they filtered out or were the missing amino acids masked as unknown?