TCL icon indicating copy to clipboard operation
TCL copied to clipboard

Inference + Hugging Face

Open mattmdjaga opened this issue 1 year ago • 3 comments

Is there an easy way to inference the model on some new examples? Also, are there any plans to put the model on Hugging face?

mattmdjaga avatar Aug 11 '22 11:08 mattmdjaga

Hi, thanks for your interest in our work. What kind of inference do you mean? Image-text retrieval tasks? In terms of putting the model on HuggingFace, I need to check with my team and let you know later. Thanks.

viyjy avatar Aug 11 '22 17:08 viyjy

Inference for the generation task Visual question answering (VQA).

mattmdjaga avatar Aug 12 '22 10:08 mattmdjaga

I see. What't the difficulties of applying our current inference code on general VQA? VQA.py cannot be used in your tasks?

viyjy avatar Aug 12 '22 18:08 viyjy

I tried to extract an inference code from the 'VQA.py' file but I found that you need to supply the model, question and answers. Whereas I thought that the model will generate the answers. So is there no way to inference on VQA without having pre-defined answers?

mattmdjaga avatar Aug 14 '22 13:08 mattmdjaga

The model will generate the answers using an answer decoder, isn't it? The only difference is that we constrain the answer decoder to only generate from the 3,192 candidate answers.

viyjy avatar Aug 15 '22 04:08 viyjy