SKBL5694 comments

Results 26 comments of


                                            SKBL5694

[Bug]: RuntimeError: FIND was unable to find an engine to execute this computation

I get the same bug, and I change pytorch 2.0.0 to torch1.13.1+cu117(I guess 1.x.x all works well), it works well.

Some doubts about weights

> Hi, for the VQA task, the text encoder and decoder do not share parameters (their parameters are shared during pre-training). Thanks for your reply. I think you mean the...

Is the LM better than MLM?

> Hi, the primary reason for using LM is because we want to enable image-to-text generation capability. Both losses perform similarly in terms of VL representation learning (MLM can be...

Tag to class names for downstream applications?

I think the key lies in how to define **open-vocabulary**. The following is my personal understanding of ram after reading code and papers: In ram, we define a large enough...

Some questions about grad-CAM showing in fig7 in paper Tag2Text.

It seems that I am indeed doing gradcam on the recognition task, because your code does not open the generation task for RAM, but I have added the generation task...

Some questions about grad-CAM showing in fig7 in paper Tag2Text.

I think I have some problem with image-tag interaction encoder performing backward calculation grad. The calculation method I use is to pre-define a hook, and then register it to the...