moleculenet
moleculenet copied to clipboard
SIDER dataset: numbers of samples disagree
Hello,
The MoleculeNet documents state that SIDER dataset contains 1427 drugs. But the original SIDER paper said it had 1430 drugs. I cannot find the description for this discrepancy. Can some help me explain this? Thanks!
Good question! If I had to guess offhand, perhaps rdkit has some processing errors on a few of the molecules? I'm not entirely sure