transformers icon indicating copy to clipboard operation
transformers copied to clipboard

Add post_process_depth_estimation to image processors

Open NielsRogge opened this issue 1 year ago • 1 comments

Feature request

The image processors of depth estimation models could benefit from a post_process_depth_estimation method, similar to the post_process_object_detection, post_process_semantic_segmentation methods.

This includes:

  • [ ] DPT
  • [ ] Depth Anything (and v2)
  • [ ] ZoeDepth (soon: #30136) => this one requires some special postprocessing logic as shown in this notebook

Motivation

People now need to manually interpolate the predicted depth to the original size of the image as seen here.

The interpolation could be replaced by a simple image_processor.post_process_depth_estimation method which postprocesses the predicted depth and interpolates it to the original image size.

Your contribution

I could work on this, but would be great if someone else can take this up.

NielsRogge avatar May 20 '24 14:05 NielsRogge

@NielsRogge, I can take this up!

the-neural-networker avatar May 20 '24 15:05 the-neural-networker

Hi, if no one is working on this issue, I can take this!

ms42402 avatar Jul 16 '24 00:07 ms42402

@ms42402 there's no active PRs, so feel free to tackle this if you'd like!

amyeroberts avatar Jul 16 '24 18:07 amyeroberts

hey anybody working on this? i would like to try and solve this issue

anirudh2781998 avatar Jul 22 '24 16:07 anirudh2781998

@anirudh2781998 As above, if there isn't an active PR open, then anyone is free to work on any issue/feature request

amyeroberts avatar Jul 22 '24 16:07 amyeroberts