deep-residual-networks icon indicating copy to clipboard operation
deep-residual-networks copied to clipboard

train_val.prototxt and solver.prototxt for Resnet-152 layer

Open ghost opened this issue 9 years ago • 11 comments
trafficstars

Hi All, Past few weeks i have been trying to Train ResNet-152 on BVLC caffe with Imagenet 2012 dataset. But my accuracy or learning remains flat all the time, can some one share the train_val.prototxt and solver.prototxt. i tried pynetbuilder but some how couldn't crack it. If someone is successful can some one share these prototxt files

ghost avatar Nov 16 '16 11:11 ghost

solver.prototxt.txt train_val1_xavier_init_prototxt.txt train_val2_use_global_stats_flase_prototxt.txt

Following are my prototxt whcih i tried... where the accuracy is flat after few epoch of Training on 4 GPU

ghost avatar Nov 16 '16 11:11 ghost

You can try to change weight_filler from xavier to msra.

acgtyrant avatar Nov 23 '16 09:11 acgtyrant

@tyrosathish I am facing the similar problem, I tried similar approach but the loss rate is just flat.

ps48 avatar Dec 02 '16 08:12 ps48

@tyrosathish can you share me your resnet-152.caffemodel?i can't parse the caffemodel which i download from the link.

Check failed: ReadProtoFromBinaryFile(param_file, param) Failed to parse NetParameter file: ResNet-152-model.caffemodel

mathfinder avatar Dec 05 '16 01:12 mathfinder

Hi guys,

I am facing the same problem too, my loss is almost around 6.9 and can not decrease. Anybody have fixed this problem?

codegank avatar Dec 05 '16 21:12 codegank

.prototxt I try to re-implement ResNet-18 on ILSVRC12, but my accuracy is about 86.66%@Top-5. Baseline is about 90%+ @Torch.

HolmesShuan avatar Dec 21 '16 10:12 HolmesShuan

@HolmesShuan Why are you using xavier filler for the Fully-Connected layer and msra filler for all the convolutional layers. I once did not use a weight filler for the Fully-Connected layer and retrieved better results. Can you explain that?

thigi avatar Apr 24 '17 17:04 thigi

@thigi I find the unofficial prototxt here. To be honest, I haven't studied it yet. Sorry. :(

HolmesShuan avatar Apr 25 '17 00:04 HolmesShuan

Hi! @thigi. I just saw those prototxt files there and got confused as the guy has used type:Data instead type:MemoryData in the TEST layer. Please enlighten me on 'if this change will or will not effect the performance of ResNet. Thanks in advance!

whitewalker007 avatar Jun 29 '17 08:06 whitewalker007

met the same problem too

shichaosuper avatar Dec 26 '17 08:12 shichaosuper

same problem too.. anyone get the answer?

zangkaiqiang avatar Jan 31 '18 07:01 zangkaiqiang