Matt comments

Results 203 comments of


                                            Matt

Auto-translate GPTNeo to TensorFlow with GPT-4

@sgugger Absolutely agree on the potential for small bugs, but in the case of model porting doesn't the CI test equivalence with the PT original? If the model accepts various...

Auto-translate GPTNeo to TensorFlow with GPT-4

I'm still working on the prompt!!!!!

New metrics and different Loss in TF version of Segformer

Interesting! Can you document exactly what error you got with the compile step and what code you ran to cause them?

New metrics and different Loss in TF version of Segformer

Yes please! Ideally if you could give us some minimal code that reproduces the issue, that would make it much easier for us to track it down. Also, sorry for...

New metrics and different Loss in TF version of Segformer

Ah, I see! The issue here is caused by some specific behaviour of the SegFormer models when using inputs of this resolution. The model outputs are actually at a lower...

New metrics and different Loss in TF version of Segformer

Yeah, refactoring our TF models to make `summary()` more usable is absolutely on the list! Unfortunately it's quite a big list, but it's definitely there.

Add TF port of BLIP

The TF port is mostly complete now and tests are passing locally - I just need to go around updating docs and auto classes and so on. The main code...

Add TF port of BLIP

Got through a lot of the comments today, but I have a couple of other things to do - will try to finish them tomorrow!

Add TF port of BLIP

The last remaining big issue is that some of the pt-tf equivalence tests fail when weights don't match up between models. This is caused by the cross-attention weights not being...

Add TF port of BLIP

The issue seems to be that in all of our other models, cross-attention layers are only added when `config.add_cross_attention` is True, but in the case of BLIP it only checks...