sceneparsing icon indicating copy to clipboard operation
sceneparsing copied to clipboard

Faile to reproduce DilatedNet performance

Open DonghyunK opened this issue 7 years ago • 2 comments

Hi,

I am trying to reproduce DilatedNet.

However, my training results show that pixel acc : 72.4% mean acc: 38.6% mean iou: 28.7%.

Further training does not show improvement.

I am using a pre-trained net and multiple gpus with mini-batch size of 8. I did not use augmentations as the paper do not explain what augmentations are used. I expect that augmentation does affect the results at a small amount, otherwise you probably present augmentations in the paper.

(1) Could you explain what augmentations are used and how much does it improve results?

(2) Could you provide training and validation log files?

Thank you so much.

DonghyunK avatar Apr 17 '17 20:04 DonghyunK

Augmentation only helps a little (<2%), we only did flipping during training. Try to initialize the model with a VGG network pretrained on ImageNet; do not add layers like batch normalization.

hangzhaomit avatar Apr 27 '17 18:04 hangzhaomit

@DonghyunK , can you comment if the above worked? Also, @hangzhaomit , what do you mean initialize the model with a VGG pretrained on imagenet...is the DilatedNet just a standard VGG? won't the difference in convolution type cause incompatibility?

balloch avatar Jun 22 '17 16:06 balloch