logfire LLM qualitative evaluations and labeling

LLM qualitative evaluations and labeling

Open Luca-Blight opened this issue 9 months ago • 2 comments

Description

It would be nice to have a place in the platform for this. Another option would be to allow for integration with a partner that does provide it.

Feb 19 '25 20:02 Luca-Blight

Yup, we're going on this very thing, see https://github.com/pydantic/pydantic-ai/issues/915 and linked pull request.

Feb 19 '25 21:02 samuelcolvin

That's awesome to see!

One thing that could be an interesting feature to have, particularly for online performance, is to enable the ability for another model to be set up as evaluator versus using a human.

Feb 19 '25 21:02 Luca-Blight

logfire logfire copied to clipboard

LLM qualitative evaluations and labeling

Description

logfire
logfire copied to clipboard