Ant-Multi-Modal-Framework
Ant-Multi-Modal-Framework copied to clipboard
Any plan to use M2-Encoder to make better text-video retrieval?
Congratulations on training such a great model!Is there any plan to use M2-Encoder to make better text-video retrieval?