CrowdDet icon indicating copy to clipboard operation
CrowdDet copied to clipboard

Label assignment in multi-classes prediction

Open zehuichen123 opened this issue 5 years ago • 12 comments

Hi, I am trying to implement CrowdDet on my own. However, I wonder what's your strategy when handling multiple class detection? In your code fpn_roi_target.py, I notice that you simply take the top-2 iou label as the target for each proposal, so maybe they can be two different classes?

zehuichen123 avatar Jun 05 '20 09:06 zehuichen123

The multiple class detection will be implemented in a few days. Please wait for the new implementation.

xg-chu avatar Jun 06 '20 14:06 xg-chu

Could you please simply describe your strategy so I can give it a try on my own? Thanks!

zehuichen123 avatar Jun 06 '20 14:06 zehuichen123

@Purkialo Another question, for refine module, did you repeat the bbox data 4 times for coco or simply one time, since in crowdhuman dataset, you repeated it 4 times which convert it to a 20 dim array. I tried to concatenate one 400(80 * (4 + 1)) dim features to original roi feature but ended up with NaN loss :(

zehuichen123 avatar Jun 10 '20 05:06 zehuichen123

We recommend that you only use the simple version of emd without refinement module. If you want to use the refinement module, I think you only need to concat the feature, and don’t repeat the features.

xg-chu avatar Jun 10 '20 16:06 xg-chu

Hi, similar with @zehuichen123, I am quite confused about why did you repeat the bbox data 4 times in refine model. Hope for your answer. Thanks a lot!

LearnerZhou avatar Jun 30 '20 11:06 LearnerZhou

Here is my guess: the original feature dim is 1024 and if we only append coordinates to the roi feature, only 4 + 1 dim vector will be appended, which means little influence on refinement results(5 vs 1024), so maybe repeating it multiple times is better, I think 4 here is an empirical value.

zehuichen123 avatar Jun 30 '20 14:06 zehuichen123

Wow, it sounds reasonable. Thanks for your warm help!

LearnerZhou avatar Jul 01 '20 07:07 LearnerZhou

Here is my guess: the original feature dim is 1024 and if we only append coordinates to the roi feature, only 4 + 1 dim vector will be appended, which means little influence on refinement results(5 vs 1024), so maybe repeating it multiple times is better, I think 4 here is an empirical value.

👍

xg-chu avatar Jul 01 '20 13:07 xg-chu

So, can one anchor predict two different classes?

taofuyu avatar Aug 03 '22 10:08 taofuyu

@Purkialo has the implementation of a single bounding box and multiple labels implemented? can u share the code @taofuyu i am also interested in this did u see any other repos like this

abhigoku10 avatar Aug 03 '22 10:08 abhigoku10

@Purkialo has the implementation of a single bounding box and multiple labels implemented? can u share the code @taofuyu i am also interested in this did u see any other repos like this

I think one anchor can predict multi classes, but it is not necessary. Because NMS is applied to classes one by one, if there are different-class objects predicted in one set, Set-NMS will be invalid

taofuyu avatar Aug 04 '22 01:08 taofuyu

@taofuyu ya the NMS will be invalid but what i looks for is multi label classification with single bounding box , i want the nms operation btw the different anchor boxes and its classes and not within them

abhigoku10 avatar Aug 04 '22 02:08 abhigoku10