multimodality topic
cornac
A Comparative Framework for Multimodal Recommender Systems
CM3Leon
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
Med-PaLM
Towards Generalist Biomedical AI
mirasol-pytorch
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
deploy-stable-diffusion-model-on-amazon-sagemaker-endpoint
Deploy Stable Diffusion Model on Amazon SageMaker Endpont
gluonmm
A library of transformer models for computer vision and multi-modality research
harmful-memes-detection-resources
Resources (conference/journal publications, references to dataset) for harmful memes detection.
LIMoE-pytorch
PyTorch implementation of LIMoE
Matcha-agent
Official implementation of Matcha-agent, https://arxiv.org/abs/2303.08268
DocumentCLIP
[ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents