gpt-4-vision topic

List gpt-4-vision repositories

LibreChat

32.9k
Stars
6.6k
Forks
32.9k
Watchers

Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message se...

lobe-chat

69.7k
Stars
14.3k
Forks
69.7k
Watchers

🤯 LobeHub - an open-source, modern design AI Agent Workspace. Supports multiple AI providers, Knowledge Base (file upload / RAG ), one click install MCP Marketplace and Artifacts / Thinking. One-clic...

gptty

48
Stars
7
Forks
Watchers

ChatGPT wrapper in your TTY

sgpt

405
Stars
33
Forks
405
Watchers

SGPT is a command-line tool that provides a convenient way to interact with OpenAI models, enabling users to run queries, generate shell commands and produce code directly from the terminal.

openai-vision-api-for-videos

61
Stars
9
Forks
Watchers

Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦

maestro

2.7k
Stars
220
Forks
2.7k
Watchers

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

sports

475
Stars
32
Forks
Watchers

Cool experiments at the intersection of Computer Vision and Sports ⚽🏃

ViP-LLaVA

292
Stars
22
Forks
Watchers

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

py-gpt

536
Stars
105
Forks
Watchers

Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, Gemini, Claude, Llama 3, DALL-E, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, agents, code/command...

SirChatalot

72
Stars
14
Forks
72
Watchers

SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and DALL-E, Stability AI or YandexART for image creation. It can use vision capabilities, tool...