on-device-dg icon indicating copy to clipboard operation
on-device-dg copied to clipboard

Question on splits of dataset DOSCO

Open hxynjk opened this issue 2 years ago • 1 comments

Hi! Thanks for this great contribution to domain generalization. I note that you supply the domain labels of each sample in DOSCO. Besides, you also supply the random split of "train", "valid", "test" set. I want to know if you consider the domain labels while splitting the "train", "valid", "test" set.

Meanwhile, I note that the domain labels vary from 0 to 9, which makes it hard for us to split samples of each domain separately. If we do this, there will be so few samples in some domains that we can not train the network with enough samples.

Thanks for your reply! Thanks for sharing the public dataset.

hxynjk avatar Feb 08 '23 13:02 hxynjk

The domain labels in DOSCO are not supposed to be used. You can treat DOSCO like "ImageNet -> ImageNet-V2 / Sketch / Rendition"

KaiyangZhou avatar Feb 09 '23 02:02 KaiyangZhou