CogVLM icon indicating copy to clipboard operation
CogVLM copied to clipboard

Reproduce results on visual7w.

Open sleepyshep opened this issue 6 months ago • 2 comments

System Info / 系統信息

cuda 11.8 torch 2.3.0

Who can help? / 谁可以帮助到您?

No response

Information / 问题信息

  • [ ] The official example scripts / 官方的示例脚本
  • [X] My own modified scripts / 我自己修改的脚本和任务

Reproduction / 复现过程

之前我成功复现了论文中refcoco的结果,但在evaluate visual7w时遇到了问题。 我参考了shikra在evaluate visual7w时的代码,使用如下的prompt来提示cogvlm "Please give a brief and direct reply to 'Which item in the photo is the chair at the desk? Candidates: [433,755,512,966] [003,013,053,895] [002,816,438,996] [180,596,397,996] answer in box format.' with the image"

Expected behavior / 期待表现

请问我的prompt有问题吗,能否提供一份您用来做visual7w任务的prompt,感谢!

sleepyshep avatar Jul 28 '24 01:07 sleepyshep