Ryan H. Tran

Results 87 comments of Ryan H. Tran

Okay finally the score is converging to what we want, thanks @enyst for all the improvement suggestions! On the subset of 93 verified instances, CoAct resolved 33/93 while CodeAct resolved...

> The reason I suggested we take a look at the default agent changes, was just to make sure that it doesn't change its normal behavior. Give or take some...

Hey @mamoodi, thanks for checking in. I’m a bit tied up with other tasks at the moment, so I won’t be able to get back to this right away. Maybe...

I'm thinking about whether we should still make this change in the prompt, as encouraging the agent to use `view` over `ls -R` can save us on tokens, hence allowing...

The latest commit from `main` on the branch as I can see is `01462e11d710327d6a389ce2d34c792e780ca3c4`. And it's after the 0.9.4 release.

I think no, sometimes I got retries after rate limit error and everything still worked fine. Not sure if it's addressed in some PRs before, maybe we can close this...

> 1. It checks the file content via `cat -n`. The output includes line numbers (due to `-n`). > 2. It calls `str_replace` but it includes the line numbers in...

When running via the UI I see the hidden count is shown, but looking into the output of `view` in the evaluation output of instance `astropy__astropy-14995` not sure why it's...

@mamoodi in the evaluation job did we run a `poetry install`? When running eval locally I see the output is updated, while seems like it's not in the zip file...

@enyst Can you elaborate it a bit, maybe with an example? I'm not sure I'm understanding your concern 😅 Here's what the output looks like, which makes sense to me...