Przemyslaw Tredak comments

Results 142 comments of


                                            Przemyslaw Tredak

Shuffle in groups for IndexedRecordIO

I don't think it would make big difference (and user is free to change it to lower value if they want) - the indexed recordio version of shuffle is much...

Shuffle in groups for IndexedRecordIO

@piiswrong @zhreshold Do you have any other comments?

The implementation of ResNet is different from official implementation in Caffe

This is only partially true (and the issue should not be closed). Downsample is one of the convolutions that should have stride 2 (and it has, like you pointed out,...

Using mxnet on RTX3090

Alternatively you can use the NGC container: https://ngc.nvidia.com/catalog/containers/nvidia:mxnet , version 20.10 supports `sm_86` (so RTX3000 series).

[BUGFIX] Fix nms kernel's out of range access issue

This is a legitimate failure - we are using C++17 which does not need message in static_assert, but the previous versions do (and 1.x uses older C++ standard) - you...

Better handling of the engine destruction

Could you try CUDA 11.something? There was a change in 11.2 I believe that should help here.

Feature request: Add Llama-style MLP with three linear layers

Hi @rationalism, Llama is actually supported by TE's LayerNormMLP module via the `swiglu` activation. For performance reasons we fuse the 2 Linear layers into a single one. I recommend looking...

Feature request: Add Llama-style MLP with three linear layers

Closing this issue since GLU activations are supported in TE and there was no activity here for over a month. Please feel free to reopen if you believe that we...

TransformerEngine v1.2.1 throws CuDNN frontend error on H100 GPU (AWS p5.48xlarge instance)

Hi @sirutBuasai, what is the cuDNN version you are using?

TransformerEngine v1.2.1 throws CuDNN frontend error on H100 GPU (AWS p5.48xlarge instance)

@cyanguwa I think we still should catch this error from cuDNN Frontend and just disable cuDNN's implementation of attention in this case.