gpt4v topic

List gpt4v repositories

AppAgent

4.4k
Stars
471
Forks
56
Watchers

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

tarsier

1.1k
Stars
54
Forks
Watchers

Vision utilities for web interaction agents 👀

sketch2app

62
Stars
37
Forks
Watchers

The ultimate sketch to code app made using GPT4 vision. Choose your desired framework (React, Next, React Native, Flutter) for your app. It will instantly generate code and preview (sandbox) from a si...

Awesome-Multimodal-Prompts

190
Stars
15
Forks
Watchers

Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.

amazing-openai-api

126
Stars
10
Forks
Watchers

Convert different model APIs into the OpenAI API format out of the box.

InternLM-XComposer

1.8k
Stars
118
Forks
Watchers

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

MobileAgent

1.9k
Stars
159
Forks
32
Watchers

Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

GPT4-Vision-React-Starter

62
Stars
41
Forks
Watchers

Early Alpha Release: Chat with Your Image - Leveraging GPT-4 Vision and Function Calls for AI-Powered Image Analysis and Description

WebcamGPT-Vision

253
Stars
41
Forks
Watchers

Lightweight GPT-4 Vision processing over the Webcam