CLIP issues

CLIP Training Code

232

Not really an issue, I just want to share my training code since some people still have some difficulties to write the training code. Just modify the code to suit...

vinson2233

Any plan to add `Swin` transformer?

2

`Swin transformer` achieves higher accuracy in model size and computational amount similar to `ViT`. I think that using clip's method and dataset will show higher performance. - ViT-B/16, 384x384, 86M,...

klae01

What exactly are the sources of WebImageText dataset?

5

In [this paper](https://arxiv.org/pdf/2103.00020.pdf) there is only a vague description about the WIT dataset: > ... we constructed a new dataset of 400 million (image, > text) pairs collected form a...

andrey-savov

Question: Do the input tokens have to come from clip.tokenize(str) when using the pretrained model?

Can I use a different method to tokenize the input prompt and still get a proper prediction or must I use the `clip.tokenize(str)` method? I'm wondering if I can, for...

sunnypurewal

clip.model.build_model does not work if device is cpu

4

Hi, Thanks for providing this really convenient package to use the CLIP model! I've come across a problem with `build_model` when trying to reconstruct the model from a state_dict on...

Toekan

How to restart the training from checkpoint?

2

Hi, I want to train the model with my dataset. But there is a little question , would you please help me. How to restart the training from checkpoint? How...

snow1929

Error when evaluate CLIP with RN50x16 on ImageNet

1

Thank you for your amazing paper, I am trying to evaluate CLIP with RN50x16 on ImageNet, output = model.encode_image(test_image) but get error: File "", line 1, in output = model.encode_image(test_image)...

euminds

How to evaluate the CLIP model results?

4

Hi, I have trained a clip model using image and its caption. Now, i want to evaluate the performance of the model like Precision, Recall, F1 Score. How I can...

karndeepsingh

ImageNet evaluation with linear-probe - logistic regression's C parameter

Thank you for your amazing paper, I am trying to evaluate CLIP with a linear-probe on ImageNet, but wish to save some of the compute needed for the sweep required...

IdoGalil

How to train CLIP from scratch？

1

Hi, Thanks for the great work. Due to the needs of specific tasks, i want to train CLIP from scratch without using BPE coding and the length limit of 77,...

newbietuan

CLIP
CLIP copied to clipboard

Metadata

CLIP Training Code

Any plan to add `Swin` transformer?

What exactly are the sources of WebImageText dataset?

Question: Do the input tokens have to come from clip.tokenize(str) when using the pretrained model?

clip.model.build_model does not work if device is cpu

How to restart the training from checkpoint?

Error when evaluate CLIP with RN50x16 on ImageNet

How to evaluate the CLIP model results?

ImageNet evaluation with linear-probe - logistic regression's C parameter

How to train CLIP from scratch？

← Metadata

Owner

Metadata

CLIP CLIP copied to clipboard

Metadata

← Metadata

Owner

Metadata

CLIP
CLIP copied to clipboard