backscrub icon indicating copy to clipboard operation
backscrub copied to clipboard

Process parts of image individually / pad image to model aspect

Open BenBE opened this issue 3 years ago • 3 comments

To avoid clipping the image partway it would be nice if the image may be split into two (or more) overlapping areas that are fed to the NN and recombined after detection (by e.g. ORing the results together). This is overall a bit slower but would allow for arbitrary aspect rations to be handled. This might also allow for feeding a scaled image into the NN and refining the result area by area.

BenBE avatar Jan 21 '21 08:01 BenBE

Hmmm, interesting idea. That might work with deeplab, worth a try. For the Google Meet model, I think it's mostly trained on images showing the full portrait in the center of the frame, so splitting the person across "tile" boundaries might not work at all...

floe avatar Jan 21 '21 08:01 floe

For the Google Meet model padding the image on top/bottom might be worth a try (resizing the image if necessary, even though slower that way).

BenBE avatar Jan 21 '21 13:01 BenBE

Good point, this should be easy to try.

floe avatar Jan 21 '21 18:01 floe