AttrScore
AttrScore copied to clipboard
lower in the F1 score obtained by using the fine-tuned FLAN-T5-XL and FLAN-T5-large models
Hi, Thank you for providing the fine-tuned models in the repository. I used the inference_alpaca.py code to evaluate the FLAN-T5-XL and FLAN-T5-large models on simulation dataset. However, the F1 score that I am getting are lower than what has been reported in the repository. Can you tell me if there is some setting that needs to be changed?
Following are the number that I am getting on running the inference: FLAN-T5-large (reported) | 57.3. | 50.1 | 70.5 FLAN-T5-large (obtained) | 53. | 49 | 57
Thanks, Sonam Gupta