multimodal-retrieval topic
cross-modal_entity_consistency
This repository contains the dataset and source files to reproduce the results in the publication Müller-Budack et al. 2021: "Multimodal news analytics using measures of cross-modal entity and context...
artemis
Official code release for ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity (published at ICLR 2022)
Multimodal-Image-Retrieval
Explores early fusion and late fusion approaches for Multimodal medical Image Retrieval
GENIUS-CVPR25
Official Implementation of GENIUS: A Generative Framework for Universal Multimodal Search, CVPR 2025
VARAG
Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
MRAGSurvey
A Survey of Multimodal Retrieval-Augmented Generation
VISTA_Evaluation_FineTuning
Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original code and model can be accessed at FlagEmbedding.