Ryan H. Tran comments

Results 87 comments of


                                            Ryan H. Tran

Upgrade openhands-aci to 0.1.7

Ran an eval on the 30 instances above locally, the result looks reasonable (baseline got 13/30). CC @xingyaoww

Upgrade openhands-aci to 0.1.7

No, the ordering fix doesn't go into this release. This only contains your fix

Upgrade openhands-aci to 0.1.7

@xingyaoww Running eval after adding the sorting fix and this pending PR: https://github.com/All-Hands-AI/openhands-aci/pull/51, now we get 12/30 compared to 13/30:

[Bug]: Discrepancy between Openrouter's deepseek-r1 and deepseek-reasoner

Yeah, indeed there seems to be a relevant issue there: https://github.com/BerriAI/litellm/issues/8193

Proposal: Simplify microagents + support MCP natively

> In addition, can't we just refer to the MCP in the microagent content? How do you envision the frontmater processing for this particular field? I think `mcp_location` is the...

[Bug]: (eval) Instance results with llm proxy `OpenAIException` errors got merged into output.jsonl

Unfortunately from the trajectory in the jsonl file there're no traceback. There's only one last entry from the `history` field beside the `error` field above. I can try capturing the...

[Bug]: (eval) Instance results with llm proxy `OpenAIException` errors got merged into output.jsonl

Thanks for the fix! Btw can you explain why retrying the whole eval is better? Not sure about the architectural side, but imo it may be not necessary to run...

[Bug]: (eval) Instance results with llm proxy `OpenAIException` errors got merged into output.jsonl

Yeah, from my side I can see the retries happen after your fix. Recently with the new LLM proxy I don't even receive 502 errors anymore. Maybe this PR can...

[Experimental] Integrate repomap

Yep sounds good! Thanks for the idea, I'll have a closer look.

Upgrade `openhands-aci` to v0.1.2

Hmm... Now the regex parsing causes the test with ipython code containing multiple `file_editor` calls fail: https://github.com/All-Hands-AI/OpenHands/blob/9908e1b28525fe96394446be95fcb00785d0ca0c/tests/runtime/test_ipython.py#L278-L290