Checking similarity of input ligand w.r.t training set

Open zhang-ivy opened this issue 10 months ago • 0 comments

As described in the preprint, one of the criteria for constructing the validation set is "at least one of the small-molecules exhibits a tanimoto similarity of 0.8 or less to any small-molecule in the training set." Would it be possible to share your workflow (code and ligands in training set) for computing small molecule similarity to the training set? I am curious how similar my input ligand is to the training set ligands.

Apr 15 '25 20:04 zhang-ivy