Unni Krishnan R Nair

Results 1 comments of Unni Krishnan R Nair

@haotian-liu In my understanding GPT4v slices higher resolution images into 512x512 images plus one context image and then tokenizes + collates those tokens. Have you tried something like this with...