ImageBind
ImageBind copied to clipboard
how can i extend this model to multilingual version
excuse me ,I'm new to this field,i want to extend this model to multilingual version,there are oneproblem:
- for audio,how can i realize multilingual version,should i just extend text encoder to multilingual version? thank you
I think this idea is cool, is there any research about this?