DiffusionDet icon indicating copy to clipboard operation
DiffusionDet copied to clipboard

The signal scaling in the training stage

Open pengxinliu opened this issue 2 years ago • 3 comments

Hi,

Thanks for sharing your wonderful work. I can't understand the signal scaling equation pb = (pb * 2 - 1) * scale in the training stage, can you explain the reason why transform pb by this equation in more detail?

Many thanks!

pengxinliu avatar Nov 22 '22 11:11 pengxinliu

Hi,

Thanks for your interest in our work.

The raw signals~(eg, image pixel, ground truth boxes coordinates) should be scaled to some range before combination with noise. Taking image generation, for example, raw pixel values are typically scaled to [-1, 1].

We also conduct ablation studies on this scaling value in Table 3 (a) in our manuscript. A more detailed discussion can be found in Sec 4.4 Signal scaling.

ShoufaChen avatar Nov 22 '22 13:11 ShoufaChen

I have the same question. According to your references in this part, "Analog Bits" and panoptic paper convert the input data to bits signal. Here you just normalized the ground truth info to [0, 1] then conduct the scale. Is it the same concept or you just want to do the same operation? I think the scale is just to translate the center and scale the height and width of boxes. Please help me to figure out this. Thank you ~!

kaneyxx avatar Nov 30 '22 07:11 kaneyxx

I also have a question about signal scaling, the codes below are added after the combination of signal and noise, x = torch.clamp(x, min=-1 * self.scale, max=self.scale) x = ((x / self.scale) + 1) / 2. can you explain why need to scale the noisy mask again?

aoyanl avatar Jan 09 '23 13:01 aoyanl