enet-keras icon indicating copy to clipboard operation
enet-keras copied to clipboard

COCO labels

Open ahundt opened this issue 7 years ago • 3 comments

Do you represent each label as separate channels in the dataset loader?

I ask because there is a lot of class overlap in COCO and the z order isn't always correct. For example the table category often blocks out all the objects on top of the table if you put it all into a single categorical channel, rather than a one-hot (multiple-hot?) encoding.

ahundt avatar Jan 28 '18 19:01 ahundt

You're right about the issue, thanks. However, I've tried a couple ways to represent the ground truth. One of them is as a NxHxWxC tensor, each label being a separate channel, like you describe; so in that case, C is 80 plus an extra channel for background, which makes 81 in total.

It's in here.

PavlosMelissinos avatar Jan 31 '18 11:01 PavlosMelissinos

Cool thanks! I just wanted to confirm, sorry I missed those lines.

ahundt avatar Jan 31 '18 18:01 ahundt

I wonder, should the data be put in sparse tensors?

I would probably need to test to see if it is a faster or slower approach.

ahundt avatar Jan 31 '18 18:01 ahundt