PyTorch-YOLOv3-kitti icon indicating copy to clipboard operation
PyTorch-YOLOv3-kitti copied to clipboard

Format for targets

Open manueldiaz96 opened this issue 4 years ago • 0 comments

Hi!

I just wanted to ask about what is the final format of the targets when you pass them to the network for training?

From your code I understand that you read the GT from a txt file, but on datasets.py in line 96 you use np.readtxt but the labels begin with a string (see this example).

Could you please do me the favor of specifying what is the composition of the targets? I know that per image you limit the amount of boxes to 50, so the targets will have shape [batch_size, 50, 5]

Are they in the form [class, center_x_ratio, center_y_ratio, box_width_ratio, box_height_ratio]?

Where ratio means the value in pixels divided by the width of the image (for center_x and box_width) or by the height of the image (for center_y and box_height).

Thank you!

manueldiaz96 avatar Nov 13 '20 21:11 manueldiaz96