deep_Mahalanobis_detector
deep_Mahalanobis_detector copied to clipboard
Baselines comparison
You merged the in-distribution and out-of-distribution test set and split out new train/val/test set for LR based on Mahalanobis score. However, you don't do it in the same way for ODIN and temperature scaling. Is that fair? At least, I suppose you can use the same subset to report and compare AUC.