AttrScore icon indicating copy to clipboard operation
AttrScore copied to clipboard

lower in the F1 score obtained by using the fine-tuned FLAN-T5-XL and FLAN-T5-large models

Open GuptaSonam opened this issue 1 year ago • 1 comments

Hi, Thank you for providing the fine-tuned models in the repository. I used the inference_alpaca.py code to evaluate the FLAN-T5-XL and FLAN-T5-large models on simulation dataset. However, the F1 score that I am getting are lower than what has been reported in the repository. Can you tell me if there is some setting that needs to be changed?

Following are the number that I am getting on running the inference: FLAN-T5-large (reported) | 57.3. | 50.1 | 70.5 FLAN-T5-large (obtained) | 53. | 49 | 57

Thanks, Sonam Gupta

GuptaSonam avatar Jul 31 '23 12:07 GuptaSonam