Giovanni Puccetti comments

Results 100 comments of


                                            Giovanni Puccetti

use `TextEncoder` in coca `encode_image`

@iejMac my idea about the issue seems like it's wrong, another thing could be that since pad tokens are now ignored in the loss it looks higher. Tomorrow I will...

use `TextEncoder` in coca `encode_image`

@iejMac sorry I didn't have time to work on this sooner and also for wasting some compute: - higher loss is due to now ignoring pad_tokens and indeed performance is...

add generate w/ beam search

@Soonhwan-Kwon i am working on adding coco as a dataset so we can make evaluation automatically, should make the PR on a couple days unless you are doing it already

CoCa: fix MultimodalTransformer init + Mask CLS token at end of seq

No I think it used the default ones, I think the VisionTransformer doesn't call it either? I mean it calls it but it does nothing

CoCa: fix MultimodalTransformer init + Mask CLS token at end of seq

@iejMac I added one more change that should make this ready for the temptative retraining

CoCa: fix MultimodalTransformer init + Mask CLS token at end of seq

@rwightman @rom1504 @iejMac hi, I worked on this PR, as it is it has a few changes in tests, adds transformers compat and fixes the issues. This is the best...

CoCa: fix MultimodalTransformer init + Mask CLS token at end of seq

> @gpucce so discussing here so I might possibly combine this with #660 checks, this was days before my second child was born so yeah, it got lost in the...

build_cls_mask() in CoCa TextTransfotmer

Hi, There is a PR #551 to fix this but I think nobody has time to review it

Coca training related question

@JaejinCho hi do not worry about tagging :) in general I think I used the model with larger batch sizes in smaller gpus. Looking at the error it looks like...

Coca training related question

@JaejinCho @tillaczel I am using 1.13.1 but it might not be the issue. Are you trying to fine-tune a pre-trained model or pretrain a new one?