blip2 topics

Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help customers without fashion knowledge to better understand the features...

SmithaUpadhyaya

blip2

huggingface-datasets

huggingface-transformers

image

PaddleMIX

708

Stars

223

Forks

708

Watchers

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...

PaddlePaddle

aigc

blip2

clip

coca

chat-with-nerf

302

Stars

19

Forks

Watchers

Chat with NeRF enables users to interact with a NeRF model by typing in natural language.

sled-group

blip2

chatgpt

gpt-4

lerf

ComCLIP

36

Stars

5

Forks

36

Watchers

Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

eric-ai-lab

blip2

causality

clip

compositionality

qformer

29

Stars

0

Forks

Watchers

Implementation of Qformer from BLIP2 in Zeta Lego blocks.

kyegomez

ai

artificial-intelligence

attention-mechanism

blip2

MiniGPT-4-discord-bot

44

Stars

2

Forks

Watchers

A true multimodal LLaMA derivative -- on Discord!

152334H

ai

blip2

discord-bot

llama

Vision-Language-Models-Overview

444

Stars

22

Forks

444

Watchers

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.

zli12321

blip2

claude

clip

deepseek

ai-powered-video-analyzer

42

Stars

15

Forks

42

Watchers

An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll...

arashsajjadi

ai-video-analysis

audio-event-detection

blip2

gui