foundation-models topic
torchxrayvision
TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
autodistill
Images to inference with no labeling (use foundation models to train supervised models).
awesome-segment-anything-extensions
Segment-anything related awesome extensions/projects/repos.
LRV-Instruction
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Emu
Emu Series: Generative Multimodal Models from BAAI
PointLLM
[ECCV 2024 Oral] PointLLM: Empowering Large Language Models to Understand Point Clouds
ViP-LLaVA
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts