powerful-gnns icon indicating copy to clipboard operation
powerful-gnns copied to clipboard

Custom dataset creation

Open acecreamu opened this issue 5 years ago • 4 comments

Hi, thank you for such an impressive work. I would like to apply your algorithm to my task, so I need to create dataset which will fit your code. I have, say, 150 graphs of 200 nodes each, where all the nodes are equal. I'm trying to understand your txt files, but have some issues with that. For example:

10 0
0 3 1 2 9
0 3 0 2 9
0 4 0 1 3 9
0 3 2 4 5
0 3 3 5 6
0 5 3 4 6 7 8
0 4 4 5 7 8
0 3 5 6 8
0 3 5 6 7
1 3 0 1 2

My assumption is that the block correspond to a graph, 10 is a number of nodes while 0 is a graph class label. Each row correspond to a node, where first value is a (node label I guess?), second is a number of links, and other are connections. Is that right? Is row number correspond to the node index?

acecreamu avatar Apr 21 '19 10:04 acecreamu

I reviewed the code, and I have a same opinion as you.

kimsu55 avatar Apr 13 '20 11:04 kimsu55

Hi! Thanks for your interest. GIN has been implemented by major libraries like Pytorch Geometric and DGL. The implementation is highly optimized and faster. I recommend using those libraries for your purpose. Those libraries also provide the same set of datasets as our work, so you should be able to easily reproduce our work using the libraries.

weihua916 avatar Apr 13 '20 11:04 weihua916

@weihua916 Can you clarify the format for the files? I'm also wondering about this.

mdanb avatar Mar 26 '21 16:03 mdanb

We directly adopted the dataset files from https://github.com/muhanzhang/pytorch_DGCNN/tree/master/data

weihua916 avatar Apr 22 '21 03:04 weihua916