OneLLM icon indicating copy to clipboard operation
OneLLM copied to clipboard

Whether the embedings generated by different modal data has comparability?

Open mengfanShi opened this issue 1 year ago • 2 comments

just like CLIP, whether embedings generated by Universal Encoder has comparability? if can, we can perform search and matching based on the similarity of embedings for different modal data. Could you provide the Encoder part of the model separately for testing? The overall 15GB model is too large at the moment.

mengfanShi avatar Jul 11 '24 03:07 mengfanShi

Well, since we didn't train the model on exact pair data, the comparability might not satisfy your expectation at this time.

Thanks for your attention.

kxgong avatar Jul 11 '24 11:07 kxgong

Well, since we didn't train the model on exact pair data, the comparability might not satisfy your expectation at this time.

Thanks for your attention.

But I see you run the test on Music-AVQA in thesis, could u tell me how you manage to use three modalities to generate answers?Thank u very much!

Cece1031 avatar Jul 28 '24 06:07 Cece1031