retinanet.pytorch icon indicating copy to clipboard operation
retinanet.pytorch copied to clipboard

Do you have object detection bounding box visualization results?

Open c0nn3r opened this issue 6 years ago • 8 comments

Trying to debug my implementation's training on COCO, would love to see if you got this repo working on VOC.

c0nn3r avatar Sep 11 '17 20:09 c0nn3r

So, I'm still far from achieving good results on VOC (I had the time to complete only 72 epochs). Some examples (good, promising and bad).

2008_001271 2012_000745 2012_002009

My gut feeling is that something is still wrong in the encoder.

andreaazzini avatar Sep 12 '17 18:09 andreaazzini

@c0nn3r any better results on your side?

andreaazzini avatar Sep 12 '17 19:09 andreaazzini

Nothing yet, having trouble batching the bounding boxes. Does it only work when only one bounding box is needed to detect (or a lower number)? I'm thinking of writing tests for encoder.py and just brute force a working solution. I'm going to switch to getting VOC working.

c0nn3r avatar Sep 12 '17 19:09 c0nn3r

Also @andreaazzini, what are you using to produce the images shown?

c0nn3r avatar Sep 12 '17 19:09 c0nn3r

@c0nn3r https://github.com/andreaazzini/retinanet.pytorch/blob/328f63e12fdcd9e5b11f8828828b302245b13cf8/demo.py I removed demo.py from master because I'm working on an evaluation script, and the demo script needs to be changed accordingly.

andreaazzini avatar Sep 12 '17 19:09 andreaazzini

@andreaazzini I took a look at the loss function and found some problems:

  • I don't think we are normalizing focal loss over the number of background detections / ground-truth boxes (it was italicised in the paper, so it must be important!).
  • The paper claims to have ~100K anchor predictions for the box subnet, that doesn't seem to match our ~47961.

c0nn3r avatar Sep 13 '17 17:09 c0nn3r

@c0nn3r Are you using my encoder or @kuangliu's?

andreaazzini avatar Sep 13 '17 17:09 andreaazzini

@andreaazzini I'm using your encoder

c0nn3r avatar Sep 13 '17 18:09 c0nn3r