visualwebarena icon indicating copy to clipboard operation
visualwebarena copied to clipboard

VisualWebArena is a benchmark for multimodal agents.

Results 20 visualwebarena issues
Sort by recently updated
recently updated
newest added

I found some errors in annotion. In the classifieds_10: sites: ['classifieds'] task_id: 10 require_login: True storage_state: ./.auth/classifieds_state.json start_url: http://localhost:9980 geolocation: None intent_template: What is the {{attribute}} of {{item}}? intent: What...

Hello, Could you please share some of the configuration settings to reproduce the various model types? I tried to reproduce the caption-augmented setup (Acc Tre + Caps) but my value...

Thank you for your work. I can come to the the homepage with http://127.0.0.1:4399 but not the www.homepage.com. Do I have to change the host of my server ot navigate...

Hello, I'm looking to reproduce some of the open-source model results from the VWA paper: (1) Mixtral-8x7B model as the LLM backbone for Caption-augmented model (2) CogVLM for the Multimodal...

Hi There, I'm running into test failures when I run the pytest test suite. Here is my error: ``` tests/test_browser_env/test_script_browser_env.py s.s.......F ============================================================ FAILURES ============================================================ ____________________________________________________ test_click_open_new_tab _____________________________________________________ accessibility_tree_current_viewport_script_browser_env = def...

hi @kohjingyu please review if the implementation to support switching between datasets

Hi, I tried your demo agent on my web application and for most interactions it does well. However it seems to have trouble with identifying/clicking on checkboxes. Is there some...

Hi, I am looking at the scripts `run_reddit_som.sh, run_shopping_som.sh, run_classifieds_som.sh`. IIUC, they all involve creating batches of indices and the docker gets reset between each of these batches. https://github.com/web-arena-x/visualwebarena/blob/b56b6d821e0b0f926fb940a7efe7d3f1246eab36/scripts/run_reddit_som.sh#L21 However,...

Hello, when I try to run the models with accessibility_tree_with_captioner. I found that sort can not be found in the accessibility_tree_with_captioner. There is only a staticText of sort but no...

added upload to the action space, a page focus safeguard, local hosting of images to combat broken url bugs/longetivity of the benchmark, and new config files to adjust for this