Entity
Entity copied to clipboard
"High Quality Segmentation for Ultra High-resolution Images" don't see the difference between image
hello, thanks for your great work High Quality Segmentation. I want to ask the following 2 questions
- I don't see any difference in P position information between images. Even in an image, the information of the variable rel_cell is the same in all locations. So why can CRM generate detailed segmentation masks?
- What is the meaning of calculating 3 features in P because it seems to me that it is hard-fixed and concatenated into each feature of the image? below is the map that I printed out for the variables

(1) For rel_cell, when the input resolution changes (the target resolution is constant), it is not same. That CRM generates detailed segmentation masks not only results from rel_cell. Other designs also contribute to the final results. (2) The hard-fixed position information doesn't mean it's unsuitable for representation. Fixed and learnable position information is both used in the implicit representation. Thanks.