Quick Question: what is the actual training dataset volume?

Open bsun0802 opened this issue 1 year ago • 1 comments

Hi Author,

Thanks for the great work. I've been curious in the "ID-Oriented Human Data Construction" described in paper, after ID verification and other filtering, what is the final training dataset volume?

I'm curious about how many training data, will empower the ID-preservation for a diffusion model, e.g., 10k, 100k, or 1M, etc. Would love to see an ablation for how the model progress when fed with different magnitude of data size.

Thanks!

Jan 19 '24 22:01 bsun0802

Hi, for the model reported in the paper, we used filtered 110K images to train. No in-depth exploration of changes in the scale of training datasets.

Jan 20 '24 06:01 Paper99