Ryan H. Tran issues

Results 28 issues of


                                            Ryan H. Tran

Support MINT benchmark (MATH, GSM8K subset)

This PR provides a draft evaluation integration for the MINT benchmark which tests the agent's ability to solve tasks with multi-turn interactions. This benchmark tests the agent's ability of code...

evaluation

Enhance code agent's search ability with ACR's context search API

**What problem or use case are you trying to solve?** The current search skills available to the agent is: ```python - search_dir(search_term, dir_path='./'): # Searches for a term in all...

enhancement

Stale

feat(workflow): Implement a simplified CoAct workflow

**Short description of the problem this fixes or functionality that this introduces. This may be used for the CHANGELOG** - This PR implements a simplified multi-agent workflow inspired by the...

enhancement

agent framework

[Bug]: (eval) Command execution error when retrying after rate limit error

### Is there an existing issue for the same bug? - [X] I have checked the troubleshooting document at https://docs.all-hands.dev/modules/usage/troubleshooting - [X] I have checked the existing issues. ### Describe...

bug

[Bug]: (eval) Instance results with llm proxy `OpenAIException` errors got merged into output.jsonl

bug

evaluation

severity:medium

Ryan H. Tran

Support MINT benchmark (MATH, GSM8K subset)

Enhance code agent's search ability with ACR's context search API

feat(workflow): Implement a simplified CoAct workflow

[Bug]: (eval) Command execution error when retrying after rate limit error

[Bug]: (eval) Instance results with llm proxy `OpenAIException` errors got merged into output.jsonl

[Experimental] Integrate repomap

Move linter and diff utils to openhands-aci

[Experiment] Add symbol navigation commands into the editor

[Experimental] Screenshot-based browsing

Upgrade `openhands-aci` to v0.1.2