Unni Krishnan R Nair
Results
1
comments of
Unni Krishnan R Nair
@haotian-liu In my understanding GPT4v slices higher resolution images into 512x512 images plus one context image and then tokenizes + collates those tokens. Have you tried something like this with...