relu_networks_overconfident icon indicating copy to clipboard operation
relu_networks_overconfident copied to clipboard

Some questions about data processing.

Open jimo17 opened this issue 2 years ago • 1 comments

Thanks for open source. I am very interested in your paper. I have a question. Why is line 144 in train.py is [x,x,x] instead of just one x?

https://github.com/max-andr/relu_networks_overconfident/blob/ce2d3a1ab8434cdb46a2d20da291411052474636/train.py#L144

jimo17 avatar Oct 20 '22 07:10 jimo17

hi,

if i remember correctly, it's just needed to make sure that we can have more adversarial (adv) samples than the batch size (e.g., up to 3x more). note that we anyway subsample the generated data via [:n_adv]. so it's a very ad-hoc trick to make things work. definitely, a much cleaner solution must exist (and the factor of 3x is not somehow special; it's just "large enough" for practical values of n_adv that we were interested in).

best, maksym

max-andr avatar Oct 20 '22 08:10 max-andr