foundation-models topics

torchxrayvision

842

Stars

207

Forks

Watchers

TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.

mlmed

chest-radiographs

chest-xray

chest-xray-images

chestxray14

LLaVA

19.5k

Stars

2.1k

Forks

135

Watchers

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

haotian-liu

chatbot

chatgpt

gpt-4

llama

awesome-llm-powered-agent

1.4k

Stars

110

Forks

Watchers

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

hyp1231

awesome-list

chatgpt

embodied-agent

embodied-ai

autodistill

1.9k

Stars

149

Forks

Watchers

Images to inference with no labeling (use foundation models to train supervised models).

autodistill

auto-labeling

computer-vision

deep-learning

foundation-models

awesome-segment-anything-extensions

340

Stars

13

Forks

Watchers

Segment-anything related awesome extensions/projects/repos.

JerryX1110

application

awesome

awesome-list

caption-anything

LRV-Instruction

249

Stars

13

Forks

Watchers

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

FuxiaoLiu

chatgpt

evaluation

evaluation-metrics

foundation-models

InternVideo

1.3k

Stars

85

Forks

Watchers

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

OpenGVLab

action-recognition

benchmark

contrastive-learning

foundation-models

Emu

1.6k

Stars

85

Forks

20

Watchers

Emu Series: Generative Multimodal Models from BAAI

baaivision

foundation-models

generative-pretraining-in-multimodality

in-context-learning

instruct-tuning

PointLLM

538

Stars

24

Forks

Watchers

[ECCV 2024 Oral] PointLLM: Empowering Large Language Models to Understand Point Clouds

OpenRobotLab

3d

chatbot

foundation-models

gpt-4

ViP-LLaVA

292

Stars

22

Forks

Watchers

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

WisconsinAIVision

chatbot

clip

foundation-models

gpt-4