GroupRCNN icon indicating copy to clipboard operation
GroupRCNN copied to clipboard

sampling rule for the fully-labeled dataset

Open qjadud1994 opened this issue 3 years ago • 0 comments

Thank you for sharing your great work, GroupRCNN for WSSOD.

As in the below figure, GroupRCNN is trained with a subset (10% or 20% or 50%) of COCO fully-labeled datasets. image

Are there any rules or algorithms to sample a subset of COCO fully-labeled datasets? For example, when randomly sampling 10% of fully-labeled datasets, the minor (long-tailed) classes are rarely included in the 10% sampled dataset.

Did you consider the distribution of categories when sampling the COCO dataset or do you have any special rules for sampling the dataset?

qjadud1994 avatar Aug 04 '22 04:08 qjadud1994