Lai Wei comments

Results 9 comments of


                                            Lai Wei

fix dataloder len

@tjruwase Thanks for the review! will add test and reformat later today.

fix dataloder len

@tjruwase Sorry I totally forgot about this. I merged some tests from @junxu with test (Thanks to @junxu !!) It should be good now.

Is it possible to convert a GPU pre-trained model to CPU without cudnn?

cudnn.TemporalConvolution is also not converted, since it's a wrapper on cudnn.SpatialConvolution.

cudnn.convert does not convert nn.TemporalConvolution

Hi, any ideas how to convert TemporalConvolution from nn to cudnn ?

feature: Add Native Pytorch DDP Support

FYI torch distributed launch is deprecated, PyTorch suggest to use torchrun https://pytorch.org/docs/stable/distributed.html#launch-utility https://pytorch.org/docs/stable/elastic/run.html#launcher-api

An empty NDArray should have size 0

@ziyuang please trigger CI and update this PR?

ResNetV1.java Network error

FYI the Gluon implementation is using `stride` for first Conv2D and `1` for the second. Which is different from the Symbol implementation: https://github.com/apache/incubator-mxnet/blob/master/python/mxnet/gluon/model_zoo/vision/resnet.py#L90

Add C++ Predictor class for inference

@ddavydenko ping to help move forward, let's get this merged. Thanks!

PyTorch 2.2.0 NVFuser deprecation is incompatible with TransformerEngine.

Hi team, any plan to fix this? without transformer engine working it's hard to justify the price for H100s.