dspy icon indicating copy to clipboard operation
dspy copied to clipboard

Can we assign scores instead of boolean logic for validation?

Open kimianoorbakhsh opened this issue 1 year ago • 3 comments

Hello,

I am working on a task that can be evaluated with some score function such as F1-score. I noticed that the validation metrics currently are only boolean (for example, exact match, etc.). Is there a way that I can use numerical metrics? I would appreciate your help and guidance.

Thanks!

kimianoorbakhsh avatar Oct 27 '23 14:10 kimianoorbakhsh

Yes you can. See another open issue about the same thing, it has my answer

okhat avatar Oct 27 '23 15:10 okhat

See issue #145

Does it address what you have in mind or would you like something richer?

okhat avatar Oct 28 '23 22:10 okhat

See issue #145

Does it address what you have in mind or would you like something richer?

Thanks for your answer. Yes, but what I was thinking was to use something like a BLEU of F1 score as the metric, since I don't really need the outputs of the model to be exactly the same as their corresponding references in the dataset, but it is acceptable if they are somewhat similar as well.

kimianoorbakhsh avatar Nov 01 '23 18:11 kimianoorbakhsh