gpt4v topic

List gpt4v repositories

AppAgent

3.6k
Stars
343
Forks
44
Watchers

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

tarsier

375
Stars
23
Forks
Watchers

Vision utilities for web interaction agents 👀

sketch2app

15
Stars
4
Forks
Watchers

The ultimate sketch to code app made using GPT4 vision. Choose your desired framework (React, Next, React Native, Flutter) for your app. It will instantly generate code and preview (sandbox) from a si...

Awesome-Multimodal-Prompts

157
Stars
9
Forks
Watchers

Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.

amazing-openai-api

71
Stars
5
Forks
Watchers

Convert different model APIs into the OpenAI API format out of the box.

InternLM-XComposer

1.1k
Stars
74
Forks
Watchers

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

MobileAgent

1.2k
Stars
112
Forks
32
Watchers

Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

GPT4-Vision-React-Starter

49
Stars
30
Forks
Watchers

Early Alpha Release: Chat with Your Image - Leveraging GPT-4 Vision and Function Calls for AI-Powered Image Analysis and Description

WebcamGPT-Vision

243
Stars
40
Forks
Watchers

Lightweight GPT-4 Vision processing over the Webcam