pretrain-gnns icon indicating copy to clipboard operation
pretrain-gnns copied to clipboard

Question about more datasets and more baselines

Open Dinxin opened this issue 4 years ago • 2 comments

Why don't you give the associated experimental results on some regression tasks in the MoleculeNet benchmark? Additionally, there are some pertaining techniques tailored for molecule data such as mol2vec, seq2seq fingerprint. However, I don't see the model comparison.

Dinxin avatar May 25 '20 04:05 Dinxin

Great question! In this paper, we focus on GNNs. Testing all those techniques is beyond the scope of a single paper, but it is definitely an important work to do.

weihua916 avatar May 31 '20 00:05 weihua916

Given that you do a "graph-level multi-task supervised pre-training" on the preprocessed ChEMBL dataset from the publication "Large-scale comparison of machine learning methods for drug target prediction on ChEMBL" did you perhaps also check whether your GNN outperforms the multi-task feedforward neural network from this publication on this set (note that the best AUC ROC results for a random split were reported by the authors when using additional data normalization in http://www.bioinf.jku.at/research/lsc/report.pdf)?

badtom avatar Jul 14 '20 15:07 badtom