search-agents
search-agents copied to clipboard
Value function for Text WebArena is multi-modal
Thanks for sharing the interesting work!
I notice that the script to run the text WebArena evaluation utilizes the value function here https://github.com/kohjingyu/search-agents/blob/c14a5d1bf2bea07f7aaa761d460c9d44e953d95f/agent/value_function.py#L16 which is by default multi-modal.
I think this would provide an unintended edge compared to baseline models which are only prompt-based?