AppAgent issues

Can the recorded actions e.g. reference document also be shared?

Hi, thanks for sharing. I am wondering whether you could also share the reference document since it's expensive to use the GPT4V API to construct such a document. In such...

SiyuanHuang95

ask_gpt4v in model.py needs to be refactored

We should merge the JSON handling code (the `content` assembly code) from transaction layer to the base model layer, and `ask_gpt4v` method signature would need to be changed to `text,...

truebit

Exploration of Potential Enhancements to Autonomous Exploration Algorithms in AppAgent

1

Dear Contributors, I hope this message finds you well. I have been thoroughly engaged with the AppAgent framework and am particularly intrigued by the autonomous exploration capabilities that have been...

yihong1120

Comparison with AutoDroid (a similar tool)?

1

Hello, good work! I'm one of the authors of AutoDroid, an LLM-based Android task automation approach released several months ago before AppAgent. We did not advertise our work, so it...

yuanchun-li

ABD Command execution failed: adb -s 000002f62f317e4e shell screencap -p "Phone Storage/sdcard/Pictures/Screenshots"/1_before.png

6

PS D:\PythonWorkSpace\AppAgent> python learn.py Warning! No module named 'sounddevice' Warning! No module named 'keras' Welcome to the exploration phase of AppAgent! The exploration phase aims at generating documentations for UI...

CharmanY1n

Code Pauses at cv2.waitKey(0): Waiting for Keyboard Input

2

When my code reaches the line cv2.waitKey(0), how am I supposed to input? I have pressed various keys on my computer keyboard and also performed corresponding actions on my phone,...

doyer3112

Fabulous work! What about Captcha?

2

Well, just curious, have you guys come across any captcha on the Android phone. Can the Agent manage to solve it?

zwsjink

[macOS] while running 1. autonomous exploration

1

Please enter the description of the task you want me to complete in a few sentences: Task: Search for the user Bill Gates and follow him Round 1 Thinking about...

HarshaSatyavardhan

[macOS] suggestions for `get_screenshot` and `get_xml`, and issue with `traverse_tree`

2

Hi team, I am impressed by the features offered by your work. Congratulations. After trying it on macOS 14.0 from a MacBook M2, I encountered two issues: 1. I was...

danielfebrero

while running 2. human demonstration

When I running human demonstration phase, I could only see half of the labeled screenshots, as shown in the picture below, and this window could not be changed in size...

sesuii

AppAgent
AppAgent copied to clipboard

Metadata

Can the recorded actions e.g. reference document also be shared?

ask_gpt4v in model.py needs to be refactored

Exploration of Potential Enhancements to Autonomous Exploration Algorithms in AppAgent

Comparison with AutoDroid (a similar tool)?

ABD Command execution failed: adb -s 000002f62f317e4e shell screencap -p "Phone Storage/sdcard/Pictures/Screenshots"/1_before.png

Code Pauses at cv2.waitKey(0): Waiting for Keyboard Input

Fabulous work! What about Captcha?

[macOS] while running 1. autonomous exploration

[macOS] suggestions for `get_screenshot` and `get_xml`, and issue with `traverse_tree`

while running 2. human demonstration

← Metadata

Owner

Metadata

AppAgent AppAgent copied to clipboard

Metadata

← Metadata

Owner

Metadata

AppAgent
AppAgent copied to clipboard