keras-preprocessing icon indicating copy to clipboard operation
keras-preprocessing copied to clipboard

Bounding box augmentation with Keras ImageDataGenerator

Open ohadlights opened this issue 6 years ago • 4 comments

Link to question asked in stackoverflow.

The issue that I raised is the missing option to adjust labels like bounding boxes and key-points after augmenting an image with ImageDataGenerator.

For example, the applied transformation matrix is not returned to the user.

As far as I know, the status today is that if a user is training a bounding box detector, he can't use the ImageDataGenerator.

There are several ways to deal with this issue. For example, the ImageDataGenerator can add a callback that will allow to update the labels. Another option is to return the transformation matrix. Or probably best, the ImageDataGenerator can take a list of points and adjust them according to the transformation that is applied on each image.

ohadlights avatar Feb 18 '19 11:02 ohadlights

is there now a solution to this problem/question? I am also interested in this topic..

Alex1101a avatar Dec 02 '19 14:12 Alex1101a

This is interesting to me as well, I think for now we have to roll our own image augmentation that computes the resulting bounding box ?

mtourne avatar Mar 11 '20 19:03 mtourne

Any news on the topic?

rabitwhte avatar Apr 20 '20 07:04 rabitwhte

Following this RFC, I think we should make Keras preprocessing layers for this task. (Inside TF Repo)

Dref360 avatar Apr 20 '20 12:04 Dref360