叶加博

Results 25 comments of 叶加博

Yes, the images are resized to squares for example 448x448. But, the generated coordinates should be a value in the range [0,1], which are ratios that are unrelated to the...

> > For the same snippet I got the following error: > > ``` > > --------------------------------------------------------------------------- > > RuntimeError Traceback (most recent call last) > > Cell In[8], line...

Owl series support multiple images inputs. You can develop the downstream pipeline by passing a list of images and place the same number of "" in your prompt.

You can pass a list of images and place the same number of "" in your prompt.

> I pass a list of images, say 2 images, and modify the prompt. The image_tensor after preprocess has batch size of 2, while the input_ids has batch size of...

> Hi, I tried a quick implementation to compute the output likelihoods of a given interleaved image-text token sequence: > > ```python > def get_class_log_likelihoods(image_path, classes, model, tokenizer, img_processor, device='cuda',...

We updated the repository that can support instruction-tuning based on peft. And the peft requires pytorch>=1.13.1. You can use ```conda install pytorch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 pytorch-cuda=11.7 -c pytorch -c nvidia``` to...

> 我和他遇到了一样的问题,在你们的数据集上,loss不为nan,在自己的数据集上,loss为nan There is a high possibility that the issue is caused by the prompt being too long and the part complement being cut off during preprocessing. As a result,...

I just fixed the link, try again ^^

Are you using the zero-3 strategy to initialize the model? If so, the parameters may be offloaded.