open-operator-evals
open-operator-evals copied to clipboard
Opensource benchmark evaluating web operators/agents performance
Results
0
open-operator-evals issues
Sort by
recently updated
recently updated
newest added