Yuxuan Wang

Results 1 issues of Yuxuan Wang

I want to use a pretrained vision transformer from clip to extract feature from images. My original image size is 10241024. What is the largest input image size for any...