AppAgent
AppAgent copied to clipboard
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
All interactive elements on the screen are labeled with red and blue numeric tags. Elements labeled with red tags are clickable elements; elements labeled with blue tags are scrollable elements....
Hi team, I am impressed about your research. Thank you for providing such a meaningful direction towards AI. While reading through your research, I was confused about your mentioned XML...
 作者您好,arxiv论文图3,有个序号图文不一致,和您说一下。
我用正确的chatgpt appKey去登录 但是请求后就会报如下错误信息 Round 1 Thinking about what to do in the next step... Traceback (most recent call last): File "/Users/liubochen/AppAgent/e/lib/python3.10/site-packages/urllib3/connectionpool.py", line 776, in urlopen self._prepare_proxy(conn) File "/Users/liubochen/AppAgent/e/lib/python3.10/site-packages/urllib3/connectionpool.py", line 1045,...
 Is there any way to make it more accurate?
сhanged concatenation of strings to f-strings to improve readability and unify with the rest of code
 详细信息; ------------------------------------- Translated Report (Full Report Below) ------------------------------------- Process: Python [43069] Path: /usr/local/Cellar/[email protected]/3.10.13/Frameworks/Python.framework/Versions/3.10/Resources/Python.app/Contents/MacOS/Python Identifier: org.python.python Version: 3.10.13 (3.10.13) Code Type: X86-64 (Translated) Parent Process: zsh [42743] Responsible: iTerm2 [42738]...
This document is generated using: https://github.com/james4ever0/prometheous You can view the website at: https://james4ever0.github.io/AppAgent/