gnn-benchmark icon indicating copy to clipboard operation
gnn-benchmark copied to clipboard

The Corresponding category about dataset "Coauthor CS"

Open JialongWang1224 opened this issue 4 years ago • 2 comments

Hello, author, First of all, thank you very much for your work! When we use the "Coauthor CS" dataset, we will get the class label as 0 to 9. And in your paper "Pitfalls of Graph Neural Network Evaluation" said that "and class labels indicate most active fields of study for each author."(page 7) Now I need the true class name the real "active fields" not the number. Can you give some help, thank you very much.😊

JialongWang1224 avatar Sep 08 '20 10:09 JialongWang1224

Hello, author,

I am also curious to know the true class name of Amazon dataset. Is there a way to get such info? Thanks so much!

MengLiuPurdue avatar Aug 13 '21 19:08 MengLiuPurdue

You can get the original class names as follows

import numpy as np
d = dict(np.load("data/npz/amazon_electronics_computers.npz", allow_pickle=True))
print(d["class_names"])

Replace amazon_electronics_computers with the name of the dataset that you are interested in (see this folder for the list).

@JialongWang1224 I'm really sorry I didn't notice your original question.

shchur avatar Aug 14 '21 07:08 shchur