llama-recipes How to evaluate the summarization model performace using Rouge score

How to evaluate the summarization model performace using Rouge score

Open hxue3 opened this issue 1 year ago • 1 comments

I was able to replicate the quick start notebook. But I am not sure how to evaluate the fine tuned model's performance. Is there an embedded method for evaluation?

Nov 10 '23 00:11 hxue3

@hxue3 it has not been done in the notebook but you should be able to use the same function get_preprocessed_dataset(tokenizer, samsum_dataset, 'train') for validation and test and passing it as eval_dataset to the trainer.

Nov 10 '23 23:11 HamidShojanazeri

Hi! It seems that a solution has been provided to the issue and there has not been a follow-up conversation for a long time. I will close this issue for now and feel free to reopen it if you have any questions!

May 31 '24 18:05 wukaixingxp

llama-recipes llama-recipes copied to clipboard

How to evaluate the summarization model performace using Rouge score

llama-recipes
llama-recipes copied to clipboard