Entity icon indicating copy to clipboard operation
Entity copied to clipboard

"High Quality Segmentation for Ultra High-resolution Images" don't see the difference between image

Open trinh-hoang-hiep opened this issue 3 years ago • 1 comments

hello, thanks for your great work High Quality Segmentation. I want to ask the following 2 questions

  1. I don't see any difference in P position information between images. Even in an image, the information of the variable rel_cell is the same in all locations. So why can CRM generate detailed segmentation masks?
  2. What is the meaning of calculating 3 features in P because it seems to me that it is hard-fixed and concatenated into each feature of the image? below is the map that I printed out for the variables

image image

trinh-hoang-hiep avatar Sep 06 '22 09:09 trinh-hoang-hiep

(1) For rel_cell, when the input resolution changes (the target resolution is constant), it is not same. That CRM generates detailed segmentation masks not only results from rel_cell. Other designs also contribute to the final results. (2) The hard-fixed position information doesn't mean it's unsuitable for representation. Fixed and learnable position information is both used in the implicit representation. Thanks.

tcShen avatar Sep 06 '22 09:09 tcShen