evals icon indicating copy to clipboard operation
evals copied to clipboard

GPT 4 Image Evals

Open Rafcin opened this issue 1 year ago • 4 comments

Hello, I wanted to open an issue/question regarding GPT-4 evals with images. I have a few questions as I'm looking to write some evals to test a few ideas I had.

  1. Is there a rough timeline for when GPT-4 with image scanning will be available?
  2. When this feature comes out, will queries have to be done individually? Can we create a batch of images and include a prompt?
  3. If someone were to process large quantities of large images, say 10k images a day, what would that look like cost wise?

I plan to write an eval to test a significantly large dataset of high-quality 360 images of cars and answer questions the end user of a site like Carvana may have. Questions like:

  • Does this car have any visible markings, dents, or damage on the surface?
  • Will this car be able to take me off the road when I go camping?
  • Is this a good fuel-efficient car for me to use?
  • ...

Would an evaluation like this be helpful to the public? I want to test this for my needs, but if this would be useful, I can take a fragment of the set and upload it somewhere once I write the prompts.

Rafcin avatar Mar 23 '23 04:03 Rafcin

Great Idea

bilalmohib avatar Mar 24 '23 00:03 bilalmohib

That is a great question regarding the cost of image processing.

Tkinfo11 avatar Mar 24 '23 04:03 Tkinfo11

I'm curious; I've reached out to the OpenAI team but haven't heard back. My guess is they won't have pricing for a while, it seems like getting GPT-4 to scale for more users is the current challenge.

Rafcin avatar Mar 24 '23 06:03 Rafcin

I think one of the issues is the availability of the Nvidia chips needed to power the platform.

Tkinfo11 avatar Mar 24 '23 16:03 Tkinfo11

We are interested in supporting image evals as soon as that feature becomes more widely available and the API is finalized.

andrew-openai avatar Mar 25 '23 16:03 andrew-openai

@andrew-openai that's great! We are super excited for that! We're looking forward to the image feature as well as taking advantage of plugins to create tools that can interact with our vehicle databases. Hopefully this is something we can set up soon to start testing!

Rafcin avatar Mar 26 '23 17:03 Rafcin

Going to close for now, please feel free to reopen once you start building the eval :)

andrew-openai avatar Mar 30 '23 00:03 andrew-openai