xiang song(charlie.song) issues

Results 32 issues of


                                            xiang song(charlie.song)

[Documentation] Add doc about saving configuration files in gconstruct and gsprocessing.

Add doc about saving configuration files when processing graph input data (e.g. --output-conf-file). The saved configuration file will record how data are transformed (numerical features, categorical features, etc.). For example:...

0.3

[Feature Request] Support Multi-task learning

GraphStorm should support multi-task learning as its built-in capability. * Unsupervised multi-task learning: Other than using link prediction as the sole supervision signal for unsupervised learning, one can add other...

Using numpy.array to store string object is not memory efficient.

https://github.com/awslabs/graphstorm/blob/main/python/graphstorm/gconstruct/file_io.py#L153 The default behavior of numpy.array(string) will allocate too much memory than required, which is not efficient.

efficiency bug

Optimize GNN-Bert distillation user experience.

We need to support following user experience: - [ ] #513 - [ ] Resuming a distillation task from a saved checkpoint - [ ] Provide an end2end experience of...

[Bug] When using batch norm GraphStorm may fail

GraphStorm's edge sampler does not guarantee that for each edge type in training set it will sample more than 1 edges. This will cause an error in when batch-norm is...

bug

Refactor the Link prediction decoder and loss function design.

Currently, the implementation of computing losses for link prediction training (in GSgnnLinkPredictionModel.forward) is not friendly to loss functions like contrastive loss and triple loss which requires that a positive edge...

enhancement

[Draft] HAT for Graph with rich text data

*Issue #, if available:* *Description of changes:* By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

draft

0.3

xiang song(charlie.song)