camel icon indicating copy to clipboard operation
camel copied to clipboard

[Question] Web Looping Behavior in Owl + Qwen Integration

Open x-ray990011 opened this issue 8 months ago • 9 comments

Required prerequisites

Questions

Using Owl in conjunction with qwen max and qwen vl max to perform intelligent operations, the agent can only open web pages and perform the first search box retrieval. The newly opened webpage cannot be recognized, and the intelligent experience falls into a self dead loop, constantly opening new webpages, but the content of the new webpages is exactly the same.

x-ray990011 avatar Apr 08 '25 04:04 x-ray990011

Image The execution statement is as follows: Open https://www.runoob.com/ I found the program for viewing team leader elements in Python 3 inside. I have already parsed the webpage and found the. placeholder in the search box of this webpage. You can use this search box to try to find the content I want and save the searched content locally. Do not use other search software, only search within this website. After searching, check the results for each

tag. result show the picture.

x-ray990011 avatar Apr 08 '25 04:04 x-ray990011

The intelligent experience keeps opening new web pages, as shown in the picture

Image

x-ray990011 avatar Apr 08 '25 04:04 x-ray990011

@x-ray990011 Can you upload the original log output?

fengju0213 avatar Apr 08 '25 10:04 fengju0213

@x-ray990011 This may also be related to the model's capabilities, or there may be some problems with the set of marks.

fengju0213 avatar Apr 08 '25 10:04 fengju0213

gradio_log_2025-04-07.txt @fengju0213 This is the log file

x-ray990011 avatar Apr 08 '25 14:04 x-ray990011

gradio_log_2025-04-07.txt @fengju0213 This is the log file

@x-ray990011 Thank you for sharing. I've gone through the entire log and noticed that the model doesn't seem to fully understand our task. Perhaps we can modify the original prompt, for example:"搜索python3查看队首元素,总结并告诉我相关的细节内容"

fengju0213 avatar Apr 08 '25 14:04 fengju0213

gradio_log_2025-04-07.txt @fengju0213 This is the log file

btw,which version of OWL are you using?

fengju0213 avatar Apr 08 '25 14:04 fengju0213

gradio_log_2025-04-07.txt @fengju0213 This is the log file

btw,which version of OWL are you using?

the latest version

x-ray990011 avatar Apr 08 '25 15:04 x-ray990011

gradio_log_2025-04-07.txt @fengju0213 This is the log file

@x-ray990011 Thank you for sharing. I've gone through the entire log and noticed that the model doesn't seem to fully understand our task. Perhaps we can modify the original prompt, for example:"搜索python3查看队首元素,总结并告诉我相关的细节内容"

This modification method may not necessarily be clearer than my task scenario description. Mainly due to the lack of page recognition and next steps for newly opened pages, OWL seems to have fallen into its own dead loop.

x-ray990011 avatar Apr 08 '25 15:04 x-ray990011