ShuffleNet icon indicating copy to clipboard operation
ShuffleNet copied to clipboard

pre-train model

Open linchaozhang opened this issue 8 years ago • 70 comments

Have you successfully trained the model? can you sent me a pre-train model, i want to train a shufflenet-ssd with a pre-train model.

thanks!

linchaozhang avatar Jul 17 '17 06:07 linchaozhang

@linchaozhang The model is still training right now.

farmingyard avatar Jul 17 '17 08:07 farmingyard

@farmingyard what is your current accuracy of val?

KeyKy avatar Jul 18 '17 04:07 KeyKy

Hi @farmingyard , I'm sort of confused about your stride case in your deploy prototxt. Why you don't shuffle the channel here? thanks!

leochli avatar Jul 18 '17 09:07 leochli

@leolee96 Do you mean the stride=2 case? Yeah, its my mistake, i have fixed it. Thanks.

farmingyard avatar Jul 18 '17 13:07 farmingyard

+1 for pre trained model @farmingyard thank you for your share :) is it possible for you to share solver file also ?

ananddb90 avatar Jul 18 '17 13:07 ananddb90

Linear lr seems to work well

suzhenghang avatar Jul 19 '17 08:07 suzhenghang

@linchaozhang @KeyKy @leolee96 @ananddb90 The pre-trained model is released, please check ReadMe file.

farmingyard avatar Jul 25 '17 14:07 farmingyard

THX!

KeyKy avatar Jul 26 '17 01:07 KeyKy

@farmingyard Thanks a lot. But i found some differences between prototxt and caffemodel, in 'resx5_conv1', num_output : 60 in your prototxt, and 240 in your model. i think it should be 60 for 'resx5_conv1'(because the number of bottleneck channels to 1/4 of the output channels for each ShuffleNet unit, said in paper)

linchaozhang avatar Jul 26 '17 07:07 linchaozhang

@linchaozhang I think you may had a mistake,cause the model is trained with the deploy prototxt file. BTW,how can you get the model's layer output?

farmingyard avatar Jul 26 '17 10:07 farmingyard

@farmingyard I have found the real difference, missing the param "group" in resx5_conv1 and resx13_conv1 in your prototxt, and missing the shuffle_layer, i think you forget to update the prototxt.

linchaozhang avatar Jul 27 '17 02:07 linchaozhang

@linchaozhang Sorry for that I updated the deploy file 9 days ago, you may not follow it. Please check the update info. https://github.com/farmingyard/ShuffleNet/commit/fdb6e836072581cd465de2d9d07bc6571be86fca

farmingyard avatar Jul 27 '17 03:07 farmingyard

i use your model and deploy.prototxt for imagenet classification, but result seems to be totally wrong(result labels are all wrong), i compare it to caffenet model、deploy.prototxt(result labels are right)。

123chengbo avatar Jul 27 '17 08:07 123chengbo

Hi, @farmingyard Did you do any data augmentation in training stage?

chenxia-han avatar Aug 01 '17 03:08 chenxia-han

@xchani Just crop and mirror.

farmingyard avatar Aug 02 '17 04:08 farmingyard

@farmingyard Hi, thanks for your sharing, great job. And i just wonder where is your pre-trained model, you have mentioned that have released, but i just can not see anything~~I hope i have not missed anything

ghost avatar Aug 07 '17 06:08 ghost

@farmingyard Hi, thanks for sharing your great job. And I just wonder where is your pre-trained model that you have mentioned above, but i just can not find it after having checked ReadMe file

yunren avatar Aug 08 '17 07:08 yunren

@fujihuang @yunren Sorry for some reasons that I can't release the pre-trained model any more.

farmingyard avatar Aug 08 '17 14:08 farmingyard

any one can send the pre-trained model to me ? i really need it, thanks for your help @linchaozhang @farmingyard @KeyKy @leolee96 @ananddb90

CrazySssst avatar Aug 23 '17 08:08 CrazySssst

can you send me your mail-id

ananddb90 avatar Aug 23 '17 11:08 ananddb90

Hi, would u put up the val accuracy of your model, please?

yonghenglh6 avatar Aug 24 '17 06:08 yonghenglh6

Hi,ananddb90 can you send the pre-trained model to me? I want finetune it to VOC dataset.

dingdashen avatar Aug 24 '17 15:08 dingdashen

@ananddb90 this is my email : [email protected] , thank you !

CrazySssst avatar Aug 25 '17 05:08 CrazySssst

any one can send the pre-trained model to me? I hope to finetune it on faster-rcnn. @farmingyard @linchaozhang @KeyKy @leolee96 @ananddb90 . my email [email protected] thanks

acdreamdpl avatar Aug 25 '17 07:08 acdreamdpl

sorry for late reply . https://bitbucket.org/ananddb90/shuffle_net/src

could you please share your experience after fine tunning on detection problem as I am also trying the same and yet not able to get good results

ananddb90 avatar Aug 25 '17 08:08 ananddb90

thanks for you sharing. i will share my results on SSD after finish my experiment

CrazySssst avatar Aug 25 '17 09:08 CrazySssst

@ananddb90 Thanks for your sharing. Can you put up the accuracy of your sharing ShuffleNet on ImageNet?

7oud avatar Aug 28 '17 07:08 7oud

@farmingyard @ananddb90 @linchaozhang @KeyKy @leolee96 Do you have 0.5_shufflenet and 0.75_shufflenet caffe pre-trained model,train.prototxt? thank you !

zoufangyu1987 avatar Aug 29 '17 10:08 zoufangyu1987

@farmingyard @ananddb90 @linchaozhang @KeyKy @leolee96 I need to run ShuffleNet on "ARM" or "DSP", so that speed is the most important. Do you have any suggestion? Thank you very much !

zoufangyu1987 avatar Aug 29 '17 10:08 zoufangyu1987

@123chengbo I have the same problem with you , do you find the cause of this problem?Looking forward to your reply!

felixfuu avatar Aug 30 '17 02:08 felixfuu