image-text-retrieval topic

List image-text-retrieval repositories

Awesome_Matching_Pretraining_Transfering

397

Stars

47

Forks

Watchers

The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

cross-modal-retrieval

image-retrieval

SGRAF

200

Stars

37

Forks

Watchers

[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”

cross-modal-retrieval

image-text-matching

tidy

124

Stars

13

Forks

Watchers

Offline semantic Text-to-Image and Image-to-Image search on Android powered by quantized state-of-the-art vision-language pretrained CLIP model and ONNX Runtime inference engine

computer-vision

cross-modal-retrieval

mPLUG

78

Stars

5

Forks

Watchers

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)

image-captioning

image-text-retrieval

RCAR

25

Stars

2

Forks

Watchers

[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”

cross-modal-retrieval

image-retrieval

image-text-matching

image-text-retrieval

Chinese-CLIP-opencv-onnxrun

49

Stars

10

Forks

Watchers

使用OpenCV+onnxruntime部署中文clip做以文搜图，给出一句话来描述想要的图片，就能从图库中搜出来符合要求的图片。包含C++和Python两个版本的程序

image-text-retrieval

multimodal-large-language-models

BagFormer

113

Stars

33

Forks

Watchers

PyTorch code for BagFormer: Better Cross-Modal Retrieval via bag-wise interaction

cross-modal-retrieval

image-text-retrieval

vision-language

Ant-Multi-Modal-Framework

113

Stars

5

Forks

Watchers

Research Code for Multimodal-Cognition Team in Ant Group

image-text-retrieval

multimodal-learning

ComCLIP

30

Stars

2

Forks

Watchers

Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

compositionality