ScratchDet icon indicating copy to clipboard operation
ScratchDet copied to clipboard

Add code and models

Open impredicative opened this issue 6 years ago • 14 comments

This issue merely serves as a tracker. Please update it when the code and models are added for https://arxiv.org/abs/1810.08425

impredicative avatar Oct 23 '18 10:10 impredicative

hi @KimSoybean do you still have the plan to release the code and model? thanks.

zimenglan-sysu-512 avatar Oct 25 '18 08:10 zimenglan-sysu-512

@KimSoybean in your paper, batchsize=128 means 128 imgs per gpu or total batch size on 4 gpu?

hust-kevin avatar Feb 27 '19 02:02 hust-kevin

@zimenglan-sysu-512 Hi, we will release the codes and models. But we will release them after the announcement of CVPR2019. I recommend you to reproduce the code on mmdetection, which is much faster than SSD-caffe. (Our method is so easy)

KimSoybean avatar Feb 27 '19 02:02 KimSoybean

@hust-kevin Hi, the accumulate batchsize is 128, in our code for Root-ResNet34, we put 16 images on each GPU (totally 4 GPUs), and the gradient is updated by the average of double iterations. You can see the source code in SSD-caffe.

KimSoybean avatar Feb 27 '19 02:02 KimSoybean

@KimSoybean thanks, I will try it on mmdetection, I have only 1 GPU. if I set 16 images per GPU,should I change lr to 0.05/8, what's more, how to updated gradient by the average of double iterations in pytorch?

hust-kevin avatar Feb 27 '19 07:02 hust-kevin

@hust-kevin I think lr=0.03 is better, but it depends on your loss curves. If you have only one GPU, I recommend you to use GN instead of BN and train more epochs (I set totally 44 epochs on ssd of mmdetection ).

KimSoybean avatar Feb 27 '19 08:02 KimSoybean

thanks

hust-kevin avatar Feb 27 '19 10:02 hust-kevin

Do you still have the plan to release the code and model? thanks.

dby2017 avatar Mar 18 '19 04:03 dby2017

@dby2017 Recently we have trained the 512x512 models on mmdetection which took a long time. We will release codes and models when we finish it.

KimSoybean avatar Mar 20 '19 04:03 KimSoybean

Can the idea generalise to other base networks? E.g., resnet 18 layers. I tried the paper's idea by adding BN after each conv layer, and train from scratch with lr=0.01 and lr=0.05, however I'm getting worse results.

liuchang8am avatar Mar 27 '19 02:03 liuchang8am

@KimSoybean where can i get the pretrained model VGG-16-BN ?

pyupcgithub avatar Apr 03 '19 03:04 pyupcgithub

any updates?

liuchang8am avatar May 12 '19 09:05 liuchang8am

Do you still have the plan to release the code and model? thanks.

E-Dreamer-LQ avatar May 29 '19 12:05 E-Dreamer-LQ

@Forgetmyfairy I have just finished my graduation thesis for undergraduates. This weekend I will upload them.

KimSoybean avatar May 29 '19 14:05 KimSoybean