Self_Explaining_Structures_Improve_NLP_Models
Self_Explaining_Structures_Improve_NLP_Models copied to clipboard
Hyperparameters to reproduce the same results for roberta-base as mentioned in the paper
I am wondering what are the exact hyperparameters for the model to get the same results mentioned in the paper. Since the seed is set fixed, I wished running the command that you had mentioned would give the accuracy mentioned in the paper on SST-5 i.e. 57% but its not doing that.