llava topic

List llava repositories

uform

913
Stars
53
Forks
Watchers

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Sight-Beyond-Text

19
Stars
1
Forks
Watchers

This repository includes the official implementation of our paper "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"

MMC

53
Stars
3
Forks
Watchers

[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning

taggui

349
Stars
15
Forks
Watchers

Tag manager and captioner for image datasets

FindTheChatGPTer

2.0k
Stars
201
Forks
Watchers

ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利

llava-cpp-server

166
Stars
9
Forks
Watchers

LLaVA server (llama.cpp).

multi_token

150
Stars
6
Forks
Watchers

Embed arbitrary modalities (images, audio, documents, etc) into large language models.

SUPIR

3.7k
Stars
315
Forks
65
Watchers

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild

gpt-4v-distribution-shift

28
Stars
2
Forks
Watchers

Code for "How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation"

restai

329
Stars
63
Forks
Watchers

RestAI is an AIaaS (AI as a Service) open-source platform. Built on top of LlamaIndex, Ollama and HF Pipelines. Supports any public LLM supported by LlamaIndex and any local LLM suported by Ollama. Pr...