planetoid icon indicating copy to clipboard operation
planetoid copied to clipboard

original datasets ?

Open Chunpai opened this issue 6 years ago • 10 comments

Hello, could you please provide the original dataset before your preprocessing ? It seems it does not match the dataset in the following link https://linqs.soe.ucsc.edu/data. It seems your train/val/test splits are well chosen. Some labels are also different from the original dataset in https://linqs.soe.ucsc.edu/data. Do you have any ideas about this ? Thanks.

Chunpai avatar Jun 03 '18 05:06 Chunpai

Hi, I have the same question too. Could you please give me some advice?

ghost avatar Jul 03 '18 03:07 ghost

No. If you are working on something related to GCN, you probably need to just use the random splits version.

Chunpai avatar Jul 03 '18 15:07 Chunpai

Hi, I am working on this but confused on dataset. There is no clear explanation how to convert original dataset for gcn. Can you provide raw data preprocessing code files? or instructions how to do that?

Thank you !

monk1337 avatar Oct 10 '18 08:10 monk1337

Hi, I have the same question too. Could you please provide me raw data preprocessing code files?

ghost avatar Oct 25 '18 10:10 ghost

Hey, anybody has solved this problem. Could you share your code with me? Many thanks...

daiquanyu avatar Nov 22 '18 07:11 daiquanyu

Hi ,I try to write the code to create the dateset like yours but seems somting wrong. I use the dataset X = x_train+x_val+x_test (L =L_train+L_val+L_test ) to create the graph(X.shape[0]*X.shape[0]) am I wrong? do you have any idea about this problem,if so could you tell me,thank you very much!

Davidlihuang avatar May 13 '19 17:05 Davidlihuang

Hello, could you please provide the original dataset before your preprocessing ? It seems it does not match the dataset in the following link https://linqs.soe.ucsc.edu/data. It seems your train/val/test splits are well chosen. Some labels are also different from the original dataset in https://linqs.soe.ucsc.edu/data. Do you have any ideas about this ? Thanks.

@Chunpai I met the same issue when regenerating the data, cannot get such high performance by random selection. Have you solved your issue?

o0windseed0o avatar Jun 25 '19 19:06 o0windseed0o

Hi, I have the same question too. Could you please provide me raw data preprocessing code files?

readergy avatar Apr 20 '21 12:04 readergy

Hello, could you please provide the original dataset before your preprocessing ? It seems it does not match the dataset in the following link https://linqs.soe.ucsc.edu/data. It seems your train/val/test splits are well chosen. Some labels are also different from the original dataset in https://linqs.soe.ucsc.edu/data. Do you have any ideas about this ? Thanks.

@Chunpai I met the same issue when regenerating the data, cannot get such high performance by random selection. Have you solved your issue?

Excuse me, have you found the solution?

readergy avatar Apr 20 '21 12:04 readergy

Hi, I am working on this but confused on dataset. There is no clear explanation how to convert original dataset for gcn. Can you provide raw data preprocessing code files? or instructions how to do that?

Thank you !

The original dataset (http://www.cs.umd.edu/~sen/lbc-proj/LBC.html) is processed using Pickle (https://github.com/NIRVANALAN/gcn_analysis/blob/master/notebook/Plantenoid%20Citation%20Data%20Format%20Transformation.ipynb).

andrew-korea avatar Mar 29 '23 03:03 andrew-korea