open-operator-evals
open-operator-evals copied to clipboard
Opensource benchmark evaluating web operators/agents performance