relational-gcn
relational-gcn copied to clipboard
Validation loss goes to infinity
I have a large graph with 7000 nodes and 20 million triples with 110 class types and 59 relation types. Somewhere along the training, the validation loss suddenly becomes infinite. We also tried this on a much smaller graph with 7000 nodes and 117k triples with 110 class types and 10 relation types. The validation loss somehow kept increasing instead of decreasing until it went to infinity suddenly. Any insights into what might be causing it and how to debug? Is it just an issue of the graph size or could something else be wrong?
I have not experienced this myself with this code release, have you maybe made some changes to the implementation?
Maybe it is also worth trying this PyTorch implementation: https://docs.dgl.ai/tutorials/models/1_gnn/4_rgcn.html
the infinity maybe related to the dataself