CSRNet-pytorch icon indicating copy to clipboard operation
CSRNet-pytorch copied to clipboard

Why do you don't use all architecture pretrained model VGG16 ?

Open ThanhNhann opened this issue 6 years ago • 2 comments

I have read your paper and don't understand why you use the first ten layers of VGG-16 with only three pooling layers instead of all architecture pre-trained model VGG16 ? Thanks

ThanhNhann avatar Nov 21 '19 15:11 ThanhNhann

I think the reason is that while doing crowd counting, we do not need deep features which contains semantic information. These semantic information might influence the performance since we mainly need shallower feature like edges.

doubbblek avatar Feb 08 '20 07:02 doubbblek

@doubbblek Do you have a paper relevant mention about this? thanks for your answer

ThanhNhann avatar Mar 24 '20 02:03 ThanhNhann