gpt4v topic

List gpt4v repositories

AppAgent

4.9k
Stars
524
Forks
56
Watchers

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

tarsier

1.4k
Stars
82
Forks
Watchers

Vision utilities for web interaction agents 👀

sketch2app

71
Stars
37
Forks
Watchers

The ultimate sketch to code app made using GPT4o serving 25k+ users. Choose your desired framework (React, Next, React Native, Flutter) for your app. It will instantly generate code and preview (sandb...

Awesome-Multimodal-Prompts

216
Stars
17
Forks
Watchers

Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.

amazing-openai-api

142
Stars
11
Forks
Watchers

Convert different model APIs into the OpenAI API format out of the box.

InternLM-XComposer

2.5k
Stars
152
Forks
Watchers

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

MobileAgent

2.9k
Stars
265
Forks
32
Watchers

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

GPT4-Vision-React-Starter

71
Stars
41
Forks
Watchers

Early Alpha Release: Chat with Your Image - Leveraging GPT-4 Vision and Function Calls for AI-Powered Image Analysis and Description

WebcamGPT-Vision

270
Stars
47
Forks
Watchers

Lightweight GPT-4 Vision processing over the Webcam