M. Yusuf Sarıgöz comments

Results 97 comments of


                                            M. Yusuf Sarıgöz

Performance of `SequenceGenerator` degrades considerably for larger HF-compatible models

Thanks @JustinLin610! Yes, let me clarify it: - HF native generator works well with all the HF-compatible variants. - Original Fairseq decoder works well with all the Fairseq-compatible variants. -...

How to get a fine-tuned image-caption huggingface-version OFA model ?

Hi @JustinLin610, congrats on the awesome work. > and we'll soon release the one for HF transformers, Do you have any any update in this? @NohTow: > No, the model...

How to get a fine-tuned image-caption huggingface-version OFA model ?

Ok I quickly had a look at it and for the base-sized checkpoint ~100 parameters out of 965 have different names. I'll try to make a conversion tomorrow --it's definitely...

How to get a fine-tuned image-caption huggingface-version OFA model ?

I managed to convert weights from Fairseq version to Transformers-compatible one. Here's a [PR on HF Hub](https://huggingface.co/OFA-Sys/OFA-base-caption/discussions/1) for the base version --I'm also making another PR for the large size...

How to get a fine-tuned image-caption huggingface-version OFA model ?

There's no HF model repo for OFA-large-caption, so I couldn't make a PR. Instead, I uploaded the converted model to [my storage for public download](https://storage.googleapis.com/mys-released-models/OFA-large-caption.zip).

How to get a fine-tuned image-caption huggingface-version OFA model ?

> I’ll test it on monday and try to reproduce the paper results. Cool. I'd like to hear about the results of your tests as well. I'll also share the...

How to get a fine-tuned image-caption huggingface-version OFA model ?

And here comes the huge-sized image-captioning model. [Download, zipped, 2.39 GB](https://storage.googleapis.com/mys-released-models/OFA-huge-caption.zip).

How to get a fine-tuned image-caption huggingface-version OFA model ?

And here's the [Colab notebook with explanations](https://colab.research.google.com/drive/1LLJewY92LXdeug5m_ceMUHdlqrRQwSQJ?usp=sharing) that I used for conversion.

How to get a fine-tuned image-caption huggingface-version OFA model ?

> generate gibberish before the eos token Yes, I also noticed it. I haven't checked whether the authors did something particular for it, but I postprocessed my caption result as...

How to get a fine-tuned image-caption huggingface-version OFA model ?

> I guess it is hurting the performance in the end Don't think so. As previously stated, most of the Seq2Seq models have this behavior. I also observed it in...