Ryan H. Tran comments

Results 87 comments of


                                            Ryan H. Tran

feat(workflow): Implement a simplified CoAct workflow

> Hey, thanks a bunch for this @ryanhoangt ! > > I browsed through the code, and I think it's implemented quite well. Personally I think the next step could...

feat(workflow): Implement a simplified CoAct workflow

> It might be in the paper(s), but I don't quite like that the prompts now talk of `agent`, while anywhere else it is `assistant`. 🤔 Make sense, tho i...

feat(workflow): Implement a simplified CoAct workflow

@neubig Hi Prof., till now I tested on a few (13) swe-bench instances that are mutual between `swe-bench-lite` and `swe-bench-verified`, using max same 30 turns: - CoAct can resolve 8/13...

feat(workflow): Implement a simplified CoAct workflow

After running the eval on a subset of 93 instances, it's quite disappointing to see that the performance for now is pretty bad 😢. CoAct only resolved 25/93 while CodeAct...

feat(workflow): Implement a simplified CoAct workflow

> From the outside, it is a bit surprising that it is not at least equal in performance, imo. It's quite unexpected to me as well. I will upload the...

feat(workflow): Implement a simplified CoAct workflow

@ketan1741 @tobitege I uploaded the trajectory to my viz [here](https://huggingface.co/spaces/ryanhoangt/evaluation?filepaths=outputs%2Fswe_bench_lite%2FCoActPlannerAgent%2Fclaude-3-5-sonnet%4020240620_maxiter_40_N_v1.0-no-hint%2Foutput.jsonl), as uploading a subset of eval on OpenHands's official space may confuse people. Maybe we can have a look to...