fast-rcnn icon indicating copy to clipboard operation
fast-rcnn copied to clipboard

where is the prototxt file matched to imagenet models?

Open liuwenran opened this issue 9 years ago • 11 comments

I've tried to test vgg16.v2.caffemodel with VGG16's test.prototxt in demo.py, but its output is incorrect. I wonder where is the corresponding prototxt file of vgg16.v2.caffemodel?

liuwenran avatar Sep 17 '15 08:09 liuwenran

Hi, Were you able to find these prototxt files for the imagenet models?Did you write it yourself? I need them as well.

sid027 avatar Oct 14 '15 10:10 sid027

@sid027 I changed the num_output of "cls_score" layer to 201 and the num_output of "bboxes_pred" layer to 804 in train.prototxt. And I trained it on imagenet dataset.

liuwenran avatar Oct 14 '15 10:10 liuwenran

is there a particular reason you used 201/804.....do you have 201 classes?

sid027 avatar Oct 14 '15 12:10 sid027

@sid027 because there are 200 classes in imagenet dataset ,and 201 classes with “background” ,804 = 201 * 4 are bounding boxes

liuwenran avatar Oct 14 '15 12:10 liuwenran

one more question on the bounding boxes.Are they part of the training.Can I give 1000 instead of 804.Is there some specific rule that it needs to be four times?

sid027 avatar Oct 14 '15 12:10 sid027

because a bounding box is [xmin,ymin,xmax,ymax],there are four numbers for a box. [xmin,ymin] is the left-top pixel location of this box in original image ,and [xmax,ymax] is the right-bottom.

liuwenran avatar Oct 14 '15 12:10 liuwenran

ahh!thanks a lot.I will try this.

sid027 avatar Oct 14 '15 12:10 sid027

@liuwenran have a question of image database structure.Do I put each class in a subfolder?What about the background?

sid027 avatar Oct 14 '15 20:10 sid027

@sid027 you needn't to put each class in a subfolder because images will be shuffled before training,and background will be generated automatically.

liuwenran avatar Oct 15 '15 08:10 liuwenran

Hi @liuwenran! I've been trying to train this with the imagenet dataset, but have no idea on how to build the dataset (I have the JPEG files and Annotations, etc), but cant understand how to build the imdb.

any pointers?

alilemus avatar Nov 07 '15 11:11 alilemus

I modified the corresponding numbers and run the demo, and found out that the result is very bad. what is the possible reason? Does anyone encounter such an problem? thanks!

crazylyf avatar Feb 23 '17 05:02 crazylyf