Results 123 comments of Xin Yao

Close by https://github.com/dmlc/dgl/pull/4965.

@bkm-clem It still exits unexpectedly after I built it myself. Could you kindly share the binary file that works normally on macOS 11?

We're aware of this bug and will push a fix to MCore. For now, you can add the following code in https://github.com/NVIDIA/Megatron-LM/blob/89574689447d694bb19dd86fc8a6153b4467ba9d/megatron/core/transformer/custom_layers/transformer_engine.py#L464 ```python # In PyTorch, the following two tensors...

A `GroupedLinear` layer has been added in TE v1.9, and it has FP8 support.

Thanks. I'll include your commit in my internal fix PR.

Your commit is squash merged in https://github.com/NVIDIA/Megatron-LM/commit/2488e20d488b35259f6eb53267e8a0525a373599 and your author information is preserved. Thanks for your contribution. You can close this PR now.

Fixed in https://github.com/NVIDIA/Megatron-LM/commit/98b43c91d004dec254f1610d9cffae8aff8550f3.

Good catch. Fixed in https://github.com/NVIDIA/Megatron-LM/commit/e223b920724c523a098558b2c128b4d6eb7f8ff0.