oso
oso copied to clipboard
Arize Phoenix setup with a benchmark
What is it?
We should design our own benchmark using the eval system. See the work we’re also doing on Dot https://github.com/opensource-observer/oso/issues/3602
I think this is mostly done