multimodality topic

List multimodality repositories

Generative-AI

778
Stars
59
Forks
Watchers

[TPAMI 2023] Multimodal Image Synthesis and Editing: The Generative AI Era

PALI3

139
Stars
2
Forks
Watchers

Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS: SMALLER, FASTER, STRONGER"

prml

16
Stars
11
Forks
Watchers

Multimodal Fully Convolutional Neural networks for Semantic Segmentation.

swarms-pytorch

110
Stars
10
Forks
Watchers

Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊

PALI

85
Stars
8
Forks
Watchers

Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model"

This guidance creates a scalable environment in AWS to prepare genomic, clinical, mutation, expression and imaging data for large-scale analysis and perform interactive queries against a data lake. Th...

NaViT

172
Stars
9
Forks
Watchers

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

maidr

16
Stars
4
Forks
Watchers

Multimodal Access and Interactive Data Representation

RAG-Survey

1.2k
Stars
83
Forks
Watchers

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

Woodpecker

599
Stars
29
Forks
Watchers

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.