DeepEmbeddingModel_ZSL
DeepEmbeddingModel_ZSL copied to clipboard
Statistical significance of pairs of methods
In the tables, you compare all the methods in the conventional and generalized setting. So, is there any way to measure the statistical significance of pairs of methods ? In that case, over how many trials the averaged accuracy has been found ? If we know the accuracy results of different trials, I think we can we find the significance level (p-value). Or are they just evaluated on some fixed large batch of test dataset without any trials ?