Arize Phoenix setup with a benchmark

Open ryscheng opened this issue 8 months ago • 1 comments

We should design our own benchmark using the eval system. See the work we’re also doing on Dot https://github.com/opensource-observer/oso/issues/3602

Apr 21 '25 21:04 ryscheng

Apr 21 '25 21:04 linear[bot]

I think this is mostly done

Jun 09 '25 16:06 ryscheng