ViT-Adapter icon indicating copy to clipboard operation
ViT-Adapter copied to clipboard

about cropsize

Open RYHSmmc opened this issue 2 years ago • 1 comments

Hello, I feel confused about the crop size. When I run segmention demo, I find Beit process img in (512,512), but in vit-adapter, crop size usually was set in (896,896), why this size was selected? and is any association between 512 and 896?, Looking forward to your response, thanks!

RYHSmmc avatar Aug 08 '23 06:08 RYHSmmc

Crop size 896 was first adopted in the SwinV2 paper, and in order to obtain higher mIoU performance, we also adopted this setting in some models to improve performance.

SwinV2: https://arxiv.org/pdf/2111.09883.pdf

czczup avatar Sep 29 '23 07:09 czczup