DomainMix icon indicating copy to clipboard operation
DomainMix copied to clipboard

about real dataset (labeled or unlabeled)

Open lx7555 opened this issue 2 years ago • 7 comments

Thanks for your work, I noticed that you validated the importance of using unlabeled real datasets. So how do I control in code to implement data states ((labeled or unlabeled)) for training.

Thank you!

lx7555 avatar Aug 20 '22 01:08 lx7555

They should be controlled by train.py:

train_loader = get_train_loader(args, dataset_real, args.height, args.width,
                                            args.batch_size, args.workers, args.num_instances, iters,
                                            trainset=pseudo_labeled_dataset_relabel + dataset_syne.train)

pseudo_labeled_dataset_relabel denotes using the unlabeled dataset, and dataset_syne.train denotes using the labels.

WangWenhao0716 avatar Aug 20 '22 06:08 WangWenhao0716

Thank you. As your suggestion,

train_loader = get_train_loader(args, dataset_real, args.height, args.width, args.batch_size, args.workers, args.num_instances, iters, trainset=pseudo_labeled_dataset_relabel)

mean real data using the unlabeled dataset (pseudo_labeled)

while train_loader = get_train_loader(args, dataset_real, args.height, args.width, args.batch_size, args.workers, args.num_instances, iters, trainset=pseudo_labeled_dataset_relabel + dataset_syne.train) mean real data using the labeled dataset (real_labeled)

lx7555 avatar Aug 20 '22 09:08 lx7555

No.

train_loader = get_train_loader(args, dataset_real, args.height, args.width,
args.batch_size, args.workers, args.num_instances, iters,
trainset=pseudo_labeled_dataset_relabel)

means real data using the unlabeled dataset (pseudo_labeled)

train_loader = get_train_loader(args, dataset_real, args.height, args.width,
args.batch_size, args.workers, args.num_instances, iters,
trainset=pseudo_labeled_dataset_relabel + dataset_syne.train)

means real data using the unlabeled dataset (pseudo_labeled) while syne data using the labeled dataset.

WangWenhao0716 avatar Aug 20 '22 12:08 WangWenhao0716

Thank you! So as you mentioned, how to use "adding labeled real-world data to synthetic data" in your paper? I mean how to change the label state for real-world data? Best wishes

lx7555 avatar Aug 22 '22 01:08 lx7555

Maybe? train_loader = get_train_loader(args, dataset_real, args.height, args.width, args.batch_size, args.workers, args.num_instances, iters, trainset=dataset_real.train + dataset_syne.train

means real data using the labeled dataset (real-labeled) while syne data using the labeled dataset.

lx7555 avatar Aug 22 '22 01:08 lx7555

Yes.

WangWenhao0716 avatar Aug 22 '22 06:08 WangWenhao0716

Yes.

Thank you

lx7555 avatar Aug 22 '22 08:08 lx7555