LaMP icon indicating copy to clipboard operation
LaMP copied to clipboard

Dataset Description

Open Tianqi-py opened this issue 1 year ago • 6 comments

Hi there,

I was analyzing the graph dataset SIDER used in this paper and had difficulty understanding how the adj matrix is used in the model.

For example, the train adj has 1141 rows, where each row corresponds to one training data point. But each row has a different length; they are all zeros and ones. Could you explain how the adj matrix is saved here? or maybe add a dataset description file in the repo.

And also, how is the adj matrix split? In the classification task where the features from valid and test data are used to generate the representation of the training data, the adj_train should be asymmetrical and directed.

Thanks for your help in advance!

Tianqi-py avatar Nov 25 '22 16:11 Tianqi-py