MiniGPT-4 icon indicating copy to clipboard operation
MiniGPT-4 copied to clipboard

MULTIPLE OBJECT in the images are not getting detected

Open sumanttyagi opened this issue 1 year ago • 3 comments
trafficstars

what i have observed while running the pre trained model making detection . I was not able to make multiple detection say i want to detect a car but it will only detect one car not all the cars ?

  • [ ] a) is this expected behavior one box per class ?
  • [ ] b) how can i decode to get multiple bounding box for the same labels ?

sumanttyagi avatar Dec 04 '23 08:12 sumanttyagi

if you use the [detection] identifier, it is expected to extract all the bboxs for the car label.

which pretrained checkpoint are you loading? is it "MiniGPT-v2 (online developing demo)"? This checkpoint should give you more expected output.

junchen14 avatar Dec 05 '23 18:12 junchen14

Yes i am using MiniGPT-v2 (online developing demo).

while doing this i observed that in VQA it can answer that there are 3 cars in image, but while making detection it is only showing 1 . I wonder how it can predict 3 in vqa but not able to detect all the cars .

sumanttyagi avatar Dec 05 '23 20:12 sumanttyagi

Sometimes the model is sensitive to the singular and plural forms of inputs, such as 'car' and 'cars'. Maybe you can also explore different prompts.

junchen14 avatar Dec 06 '23 00:12 junchen14