deconfounder_tutorial icon indicating copy to clipboard operation
deconfounder_tutorial copied to clipboard

Standardization should be done using mean and variance of the training data only

Open johncai117 opened this issue 3 years ago • 0 comments

Standardization is done using the mean and variance of the whole dataset (including holdout) in Step 1.1. For the holdout to remain totally unseen, we cannot use the information in the holdout to help standardize the data. The standardization for both training and holdout datasets should be done using only the training's mean and variance.

johncai117 avatar May 31 '22 20:05 johncai117