AgentLab icon indicating copy to clipboard operation
AgentLab copied to clipboard

Dev branch for the ToolUseAgent

Open TLSDC opened this issue 7 months ago • 1 comments

Comes in combination with this bgym PR: https://github.com/ServiceNow/BrowserGym/pull/340

Description by Korbit AI

What change is being made?

Introduce a new ToolUseAgent and supporting benchmark data, and replace existing usage of bgym.Benchmark and bgym.HighLevelActionSetArgs with the newly defined Benchmark and HighLevelActionSetArgs from agentlab.experiments.benchmark.

Why are these changes being made?

These changes are being introduced to expand the functionality of the agent system by adding a ToolUseAgent which leverages tool descriptions to perform actions, while also supporting more refined benchmarking capabilities through the new Benchmark and HighLevelActionSetArgs classes which allow for more consistent and modular benchmarking configurations. This improves scalability and ease of future adaptations and improvements in the agent's capabilities and testing environments.

Is this description stale? Ask me to generate a new description by commenting /korbit-generate-pr-description

TLSDC avatar Apr 23 '25 18:04 TLSDC

Based on your review schedule, I'll hold off on reviewing this PR until it's marked as ready for review. If you'd like me to take a look now, comment /korbit-review.

Your admin can change your review schedule in the Korbit Console

korbit-ai[bot] avatar Apr 23 '25 18:04 korbit-ai[bot]

This PR is stale and has been merged earlier.

amanjaiswal73892 avatar Jul 14 '25 21:07 amanjaiswal73892