Yubin Wang

Results 59 comments of Yubin Wang

same problem here。

I have install mxnet by pip, the problem also occurs. System: Windows 7 Python: 2.7.13 miniconda nightly build version

@yajiedesign here are more details: ```shell python -dv hello-mxnet.py (import mxnet as mx) ``` ```shell # cleanup[2] stat # cleanup[2] cv2.multicalib # cleanup[2] numpy.lib.format # cleanup[2] numpy.testing.nosetester # cleanup[2] cryptography.hazmat.primitives.asymmetric.rsa...

yes, i have found that, the cudnn depthwise's weights shape mismatch with other methods. when using checkpoint to initialize(such as resume train, or test accuracy), mismatch shape error would occur

@bruinxiong thanks. I am also doing some experiments on resnet-18 with batch size 256. 256 is worse than 512. and there is no theoretical relationship between batchsize and dataset size,...

@bruinxiong, I have trained a renset-18 using this code with batch-size 256, after 4 days training, the result is a little better(30.4% error rate) than 512 batchsize. You may try...

@bruinxiong lr decreases at 30, 60, 90, and at 98 remove the imgressive data augmentation.

@LiJunY `get_model_tokenizer` 函数里指定你训练后的位置给`model_id_or_path` 参数