Xin Yao comments

Results 123 comments of


                                            Xin Yao

[Performance] GSpMM regression with FP16

> @yaox12 Can you add cuda versions? Added.

[Performance] GSpMM regression with FP16

I don't think this is an issue related to data alignment. Because in this GAT model, the node features are projected from 602 to 256-dimension tensors before invoking GSpMM. We...

Support of bfloat16 data type

> Thanks for the suggestion. Just be curious. Does PyTorch support bfloat16 natively? Yes. PyTorch supports bfloat16 for both CPU and GPU.

Support of bfloat16 data type

bfloat16 requires compute capability >= 8.0 and CUDA >= 11.

Do we need fallbacks for `__CUDA_ARCH__ < 800`? cc @nv-dlasalle For PyTorch, 1. bf16 arithmetic functions are supported on all CUDA architectures. For example, the following code is valid. ```python...

`CUDA error: unspecified launch failure`, similar to #3802

@BarclayII Cannot repro with the GraphSAGE example and dgl 0.9.0. Multi-worker CPU sampling and CUDA dataloader device should have been covered in the unit test now. https://github.com/dmlc/dgl/blob/5ba5106acab6a642e9b790e5331ee519112a5623/tests/pytorch/test_dataloader.py#L185-L187 @samvanstroud Are you...