deconfounder_tutorial
deconfounder_tutorial copied to clipboard
Standardization should be done using mean and variance of the training data only
Standardization is done using the mean and variance of the whole dataset (including holdout) in Step 1.1. For the holdout to remain totally unseen, we cannot use the information in the holdout to help standardize the data. The standardization for both training and holdout datasets should be done using only the training's mean and variance.