neural-subgraph-learning-GNN icon indicating copy to clipboard operation
neural-subgraph-learning-GNN copied to clipboard

Ratio of Train set and Test set size

Open abhishekrajgaria opened this issue 4 years ago • 1 comments

As written in common/data.py in load_dataset the ratio of train_set : test_set is 80 : 20,

but as we randomly generate positive and negative query-target pair, (balance case) we are getting 4096 graphs, for both training set and testing set, (imbalance case) we are getting 2048 graphs for both training set and testing set,

So won't that be an issue as the size of train_set is same as test_set?

abhishekrajgaria avatar Oct 27 '20 12:10 abhishekrajgaria

Can't confirm right now but I believe there is an option --val_size to change the validation set size. Agree val_size should be increased during a rigorous evaluation

On Oct 27, 2020, at 05:23, Abhishek Rajgaria [email protected] wrote:

 As written in common/data.py in load_dataset the ratio of train_set : test_set is 80 : 20,

but as we randomly generate positive and negative query-target pair, (balance case) we are getting 4096 graphs, for both training set and testing set, (imbalance case) we are getting 2048 graphs for both training set and testing set,

So won't that be an issue as the size of train_set is same as test_set?

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or unsubscribe.

qema avatar Oct 27 '20 20:10 qema