Ray Yarman

Results 2 issues of Ray Yarman

I only have a 16GB graphics card, so I used the CPU to run it,My code is like: **** import torch from PIL import Image from lavis.models import load_model_and_preprocess device...

I checked the predict code and paper. It seems you added the quantized image tokens to pretrained language tokenizer. In other papers, Some people separate the tokenizer of language and...