BERT-of-Theseus
BERT-of-Theseus copied to clipboard
CoLA reproducibility
Hi, I cannot reproduce the CoLA score as same as the one on paper. I followed the HuggingFace's repo to train a predecessor model with Matthew correlation score of 55.76. However, the hightest score of the successor model I got is 35.82. Could you provide the hyperparameter set for training on the CoLA dataset?
Hi @mcps5601, have you solved the issue?
I train a predecessor model with Matthew correlation of 58.04 on the dev set but could not reproduce the result of successor. The highest score I get is about 44. I try to tune the learning_rate
and scheduler_linear_k
but it looks like they do not refine the results too much.
CC @JetRunner
Hi @mcps5601, have you solved the issue? I train a predecessor model with Matthew correlation of 58.04 on the dev set but could not reproduce the result of successor. The highest score I get is about 44. I try to tune the
learning_rate
andscheduler_linear_k
but it looks like they do not refine the results too much. CC @JetRunner
May I ask how your parameters are configured