OneLLM Whether the embedings generated by different modal data has comparability?

Whether the embedings generated by different modal data has comparability?

Open mengfanShi opened this issue 1 year ago • 2 comments

just like CLIP, whether embedings generated by Universal Encoder has comparability? if can, we can perform search and matching based on the similarity of embedings for different modal data. Could you provide the Encoder part of the model separately for testing? The overall 15GB model is too large at the moment.

Jul 11 '24 03:07 mengfanShi

Well, since we didn't train the model on exact pair data, the comparability might not satisfy your expectation at this time.

Thanks for your attention.

Jul 11 '24 11:07 kxgong

Well, since we didn't train the model on exact pair data, the comparability might not satisfy your expectation at this time.

Thanks for your attention.

But I see you run the test on Music-AVQA in thesis, could u tell me how you manage to use three modalities to generate answers?Thank u very much!

Jul 28 '24 06:07 Cece1031

OneLLM OneLLM copied to clipboard

Whether the embedings generated by different modal data has comparability?

OneLLM
OneLLM copied to clipboard