omni-detr icon indicating copy to clipboard operation
omni-detr copied to clipboard

Confusion about unlabeled data

Open SayBender opened this issue 2 years ago • 4 comments

Dear Authors,

Thanks for the paper. I used the scripts to split the data into label and unlabel(omni-label) but looking at unlabeled, I don't understand why bounding boxes still exist in the unlabel dataset!? What if I want to omni-label brand new data? The whole point of omni label is not to create bounding boxes, right? So why in the ...unlabel...json we still see bbox and segmentations fully populated? I know in the images side the label type is reflected (Unsup, tagsU, etc.) but still on the annotation side, it confuses me a lot seeing annotations for what is supposed to be semi-supervised tag only or None.

I have these 2 settings: ..._omni_unlabel_seed1709_10fully0Unsup90tagsU0tagsK0pointsU0pointsK0boxesEC0boxesU.json ..._omni_unlabel_seed1709_50fully50Unsup0tagsU0tagsK0pointsU0pointsK0boxesEC0boxesU.json

"annotations": [ { "area": 625848.0, "bbox": [ 684, 174, 1068, 586 ], "category_id": 1, "id": 1, "image_id": 0, "iscrowd": 0, "point": [ 1200.0, 346.0 ], "segmentation": [ [ 684, 174, 1752, 174, 1752, 760, 684, 760 ] ] }, {

SayBender avatar Sep 08 '22 14:09 SayBender

BTW, I only have 1 class. Will that cause any issues? I appreciate any suggestions.

SayBender avatar Sep 08 '22 14:09 SayBender

Bounding boxes still exist but we didn't use them on your two cases. Please see Line147-184 of engine.py. We route each examples according to their label types. If you only have 1 class, is it similar to bees dataset?

peiwang062 avatar Sep 08 '22 20:09 peiwang062

Thank you @peiwang062 ,

I have a similar question.

What I want to know is how can I annotate new Unsup data? And new tagsU data? Not from splitting already fully annotated datasets for proof of concept purposes. But actually adding brand new data (images) with no annotation, or with image-level annotations only.

Do you mean if I go to the json annotations (unlabel) and change all bbox and segmentations to [], then everything will still be exactly the same?

amirhesamyazdi avatar Sep 09 '22 04:09 amirhesamyazdi

I think a simplest way is to organize your customized data and its annotation to coco format.

For fully labeled data, we still need its bbox, so can't let annotation []. for omni-labeled data, it depends on the annotation type. If we don't use bbox, yeah, I think setting the corresponding annotation to [] won't change the results.

peiwang062 avatar Sep 09 '22 05:09 peiwang062