py-faster-rcnn icon indicating copy to clipboard operation
py-faster-rcnn copied to clipboard

how can i train faster-rcnn-end2end if one window can have multiple labels?

Open ankurgupta7 opened this issue 8 years ago • 5 comments

ankurgupta7 avatar Oct 24 '16 23:10 ankurgupta7

How can i do that?

ankurgupta7 avatar Oct 25 '16 00:10 ankurgupta7

Hi @ankurgupta7 ,

Can you elaborate a bit more on the pain ? Ideally, while specifying annotations in 'annotations.txt' you just mention the object class along with co-ordinates for top left and bottom-right of bounding boxes for each object. Overlapping windows won't be an issue

rohitghosh avatar Oct 26 '16 14:10 rohitghosh

Yeah, true. but i dont have a dataset that has all the labels for each image. i have some images with label a and some with b. ideally they all the images would have had labels a&b. if i train with such a dataset, loss function would penalize 'a' for images with label 'b'. because it kind of assumes that both the labels are mutually exclusive. Am i making any sense? TL;DR my dataset is incomplete. how can i not penalize labels that 'should have been ' but 'are not' present in the trainig dataset?

ankurgupta7 avatar Oct 28 '16 16:10 ankurgupta7

@rohitghosh my question is essentially about converting the SoftmaxLoss in "loss_cls" layer to SigmoidCrossEntropy. fails due to mismatching size between {bottom: "cls_score", bottom: "labels"}

ankurgupta7 avatar Nov 04 '16 04:11 ankurgupta7

@ankurgupta7 Hi, did you solve this problem?

ycxia avatar Apr 23 '19 01:04 ycxia