pygcn icon indicating copy to clipboard operation
pygcn copied to clipboard

I am confused about the dataset

Open liiiiiiiiil opened this issue 5 years ago • 2 comments

In the cora.content file, I don't know what is the features mean.

liiiiiiiiil avatar Jan 05 '19 03:01 liiiiiiiiil

The .content file is the features of every node(paper). The first column is paper_id, and the last column means its ground-truth label. The columns in the middle represent the feature of this paper. Every dim of the feature here is 0 or 1. Maybe, dim1 means 'containing the word: neural network', dim2 means ..., dim3 means..

Kobeyond avatar Aug 21 '19 06:08 Kobeyond

Maybe you can refer to the paper which proposed dataset 'cora', for more details.

Kobeyond avatar Aug 21 '19 06:08 Kobeyond