Giovanni Puccetti comments

Results 108 comments of


                                            Giovanni Puccetti

fix coca training

I think coca is in the training tests already, but this does not error. The tokens are not shifted by one, so it learns to copy current token instead of...

fix coca training

> So, been thinking about this one, I really don't like the is_training, it's not done this way elsewhere. The label shift is standard, but why do we need to...

fix coca training

The reason for this is that it was meant to keep it identical to how it was before (assuming I did it right) and since compared to before the tokenizer...

Add gen reg tests

> @gpucce do you have any idea what might be causing it? what's the symptom and by how much is it 'off'? there are numerical changes across versions of pytorch,...

Add gen reg tests

> @gpucce have you run same random inputs through the different towers, save results to verify closeness within some float eps on same env but with current main and previous...

Add gen reg tests

@rwightman I think I found it, could it be the new `open_clip.tokenize` generates sequences with length 76 in some cases?

> @rwightman I think I found it, could it be the new `open_clip.tokenize` generates sequences with length 76 in some cases? specifically `open_clip.get_tokenizer("coca_ViT-B-32")("some text")` has shape `[1, 76]` in the...

Model coca_ViT-B-32 not found; available models = ['RN50', 'RN50-quickgelu', 'RN101', 'RN101-quickgelu', 'RN50x4', 'RN50x16', 'RN50x64', 'ViT-B-32', 'ViT-B-32-quickgelu', 'ViT-B-16', 'ViT-L-14', 'ViT-L-14-336']

Hi, I think there are two issues, the `--pretrained openai` flag makes you load models that were pretrained by openai while coca is not one of those, at the same...

Questions about using COCa to generate captions

Hi, @ykj467422034 can you share a snippet of the code you are actually using? From what I see the one you share is exactly the one in the readme and...

Questions about using COCa to generate captions

> > Hi, @ykj467422034 can you share a snippet of the code you are actually using? From what I see the one you share is exactly the one in the...