[Feature] 我希望能输出一张图片中两个打架的人的边界框,我该怎么设计prompt?
Motivation
1、给的prompt是
2、给的prompt是
3、给的prompt是
4、给的prompt是Please detect all fighters in the following image and mark their positions,模型会检测图片中所有的objects及其位置。
难道只能用多轮对话实现我的需求吗?
Related resources
No response
Additional context
No response
To achieve the desired output format, consider specifying it explicitly in the prompt or exploring the use of a larger language model with enhanced capabilities.
To achieve the desired output format, consider specifying it explicitly in the prompt or exploring the use of a larger language model with enhanced capabilities.
Thanks for your advice,I'm gonna start with changing the prompt. The prompt3 "the person who is fighting on the left", is it not specific enough? Maybe I can describe the person's clothes to locate him, but my demand is to locate the fighters no matter what he was wearing.