OpenHands icon indicating copy to clipboard operation
OpenHands copied to clipboard

Does the browser work?

Open shaggy2626 opened this issue 1 year ago • 5 comments

How do you use the browser? I'm unable to edit the url or get the agent to open a browser especially when it runs a flask app

image

shaggy2626 avatar May 23 '24 04:05 shaggy2626

Which agent are you using? If you are using CodeActAgent (and gpt-4o as the LLM) from main, you can actually tell the agent to browse the URL of OpenDevin and it should work.

Unfortunately, the browser is currently edit-only for the agent (e.g., if humans edit it, it may confuse the agent) - but this may change as we improve the agents.

xingyaoww avatar May 23 '24 10:05 xingyaoww

can we edit url so that we can guide agent to look for specific information from that url ??

rishi8011 avatar May 23 '24 20:05 rishi8011

When I ask the agent to browse for a URL it responds with "I don't have the capability to interact with a web browser directly". I am using the CodeActAgent and gpt-4o

cwints avatar May 23 '24 21:05 cwints

same issue with me

rishi8011 avatar May 23 '24 21:05 rishi8011

@xingyaoww check this github project how it integrates browser in it maybe it help us to improve opendevin

https://github.com/entropy-research/Devon

rishi8011 avatar May 23 '24 21:05 rishi8011

Worked for me running from the latest build.

waldy6 avatar May 25 '24 10:05 waldy6

Not every agent has the ability to open browser, if it doesn't work, pls tell us what's the agent you're using.

assertion avatar May 31 '24 10:05 assertion

As mentioned, not every agent has browser capability. However, if you choose CodeActAgent and type something like: "Go to the OpenDevin github page and tell me how many stars the github repo currently has" you can see the browser in action. Currently you cannot edit the browser url.

mamoodi avatar Jun 08 '24 18:06 mamoodi