web-ui icon indicating copy to clipboard operation
web-ui copied to clipboard

enable manual browsing to solve captchas

Open ari7cr opened this issue 6 months ago • 4 comments

Is there a way to take over the browser opened by web-ui and log-in to a website or solve a captcha, and then let the agent continue the task? I'm failing to access websites with captchas, and would like to just click it myself - or even open chrome, visit the website, log in and solve captcha, and only then start the automation.. Is that possible with web-ui?

ari7cr avatar May 15 '25 11:05 ari7cr

you can pause the agent

vvincent1234 avatar May 15 '25 11:05 vvincent1234

True, thx. Does visiting google gemini work for you via Chrome or Chromium? In both I only get a blank screen, while other websites work normally. Chrome is using the binary file path, and chromium via the patchright install.

ari7cr avatar May 15 '25 15:05 ari7cr

you can pause the agent

and then? Where/how can I access the browser?

(If that sounds like a strange question: I am logging in from another machine on the network. So, all I have is the gradio webui.)

gitwittidbit avatar Jun 03 '25 19:06 gitwittidbit

you can pause the agent

and then? Where/how can I access the browser?

(If that sounds like a strange question: I am logging in from another machine on the network. So, all I have is the gradio webui.)

use vnc

vvincent1234 avatar Jun 14 '25 00:06 vvincent1234