adk-web icon indicating copy to clipboard operation
adk-web copied to clipboard

Add configurable metric selection and multi-metric display in Evaluation UI

Open Aki-07 opened this issue 2 months ago • 3 comments

The evaluation UI currently supports only ROUGE, but the backend (adk-python) already allows multiple metrics through RunEvalRequest and exposes them via /metrics-info. This issue adds UI support for dynamically selecting metrics (e.g., ROUGE, BERTScore, LLM-as-judge, path accuracy) and displaying multiple evaluation results.

No backend changes are required only frontend updates to:

  • Fetch available metrics from /metrics-info
  • Add a metric selection dropdown in the evaluation panel
  • Include selected metrics in the /eval-sets/{eval_set_id}/run request payload
  • Render multiple metric results dynamically in the results view

Aki-07 avatar Oct 16 '25 17:10 Aki-07

@boyangsvl As discussed continuing here, requesting to tag the right people

Aki-07 avatar Oct 16 '25 17:10 Aki-07

I've assigned this issue to you. Thanks!

boyangsvl avatar Oct 16 '25 17:10 boyangsvl

Cool, @boyangsvl thanks!

Aki-07 avatar Oct 16 '25 17:10 Aki-07