Da Zheng
Da Zheng
https://gist.github.com/zheng-da/ca33af9048d09553f668f95bfd13cb0e the code shows how to calculate the memory requirement for GNN training in DistDGL.
Decouple the number of processes and the number of partitions. The number of processes is 10, the number of partitions is 10 * k.
i have itemized the work. i think we only need to fix the 4 items i listed above.
if the edge classification is to predict the existence of an edge between two nodes, we should remove the target edge in the message passing to avoid information leak. if...
https://github.com/awslabs/graphstorm/pull/714
can you try `num_samplers=0`?
is it an out-of-memory error?
is this still needed?