raster-vision icon indicating copy to clipboard operation
raster-vision copied to clipboard

More efficient prediction strategy for Object Detection

Open lewfish opened this issue 6 years ago • 1 comments

Currently we use a sliding window with 50% overlap for making predictions in object detection in order to make sure that objects will appear wholly (ie. untruncated) within some window. But maybe it isn't so bad to predict truncated bounding boxes, and we can run 4x faster by using no overlap. And also avoid having to run the post-processing step to remove duplicate predictions.

lewfish avatar May 23 '18 21:05 lewfish

Other ways of speeding this up:

  • Read in whole image to RAM instead of doing windowed reads off of disk?
  • Test if the post-processing step in object detection is a bottleneck (I think it's an n^2 operation)
  • Do chip generation on the CPU in parallel with prediction on the GPU. This is what Keras does.
  • Feed in batches with more than one image at a time.

lewfish avatar Jun 05 '18 22:06 lewfish