cross-modality topic
Image-Text-Embedding
TOMM2020 Dual-Path Convolutional Image-Text Embedding :feet: https://arxiv.org/abs/1711.05535
CM-NAS
CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
LLCM
[CVPR 2023] Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-identification
OCN-HOI-Benchmark
[AAAI 2022] Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics.
awesome-multimodal-brain-image-systhesis
awesome-conditional-content-generation
Update-to-data resources for conditional content generation, including human motion generation, image or video generation and editing.
ptp
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
MMN
Pytorch code for Towards a Unified Middle Modality Learning for Visible-Infrared Person Re-Identification
Time-LLM
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"