llava topics

uform

913

Stars

53

Forks

Watchers

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

unum-cloud

bert

clip

clustering

contrastive-learning

Sight-Beyond-Text

19

Stars

1

Forks

Watchers

This repository includes the official implementation of our paper "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"

llava

MMC

53

Stars

3

Forks

Watchers

[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning

FuxiaoLiu

arxiv

benchmark

chart

dataset

taggui

349

Stars

15

Forks

Watchers

Tag manager and captioner for image datasets

llava

FindTheChatGPTer

2.0k

Stars

201

Forks

Watchers

ChatGPT爆火，开启了通往AGI的关键一步，本项目旨在汇总那些ChatGPT的开源平替们，包括文本大模型、多模态大模型等，为大家提供一些便利

agi

llava-cpp-server

166

Stars

9

Forks

Watchers

LLaVA server (llama.cpp).

trzy

llama

llama2

llava

llm

multi_token

150

Stars

6

Forks

Watchers

Embed arbitrary modalities (images, audio, documents, etc) into large language models.

sshh12

large-context

large-language-models

large-multimodal-models

llava

SUPIR

3.7k

Stars

315

Forks

65

Watchers

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild

llava

gpt-4v-distribution-shift

28

Stars

2

Forks

Watchers

Code for "How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation"

jameszhou-gl

ai

clip

distribution-shift

generalization

RestAI is an AIaaS (AI as a Service) open-source platform. Built on top of LlamaIndex, Ollama and HF Pipelines. Supports any public LLM supported by LlamaIndex and any local LLM suported by Ollama. Pr...

llama