open-interpreter icon indicating copy to clipboard operation
open-interpreter copied to clipboard

Add "vision" to Python mode

Open tegnike opened this issue 2 years ago • 2 comments

Describe the changes you have made:

Now we can use GPT-4Vision for only Terminal mode. I added it to Python mode.

Reference any relevant issue (Fixes #795)

  • [x] I have performed a self-review of my code:

I have tested the code on the following OS:

  • [ ] Windows
  • [x] MacOS
  • [ ] Linux

AI Language Model (if applicable)

  • [x] GPT4
  • [ ] ~~GPT3~~
  • [ ] ~~Llama 7B~~
  • [ ] ~~Llama 13B~~
  • [ ] ~~Llama 34B~~
  • [ ] ~~Huggingface model (Please specify which one)~~

tegnike avatar Dec 16 '23 22:12 tegnike

@Notnaton

Hi, I've attempted to address issue #795.

Two weeks ago, when I first examined this issue, it appeared to be a more challenging case.
From what I recall, this solution I committed did not resolve the issue at that time.

However, upon revisiting it now, it seems to be working correctly.
Could you please review my latest commit to ensure that there are no outstanding problems?

tegnike avatar Dec 16 '23 23:12 tegnike

I updated it to the current main branch Sorry, I missed this PR working on a review now

Notnaton avatar Jan 22 '24 19:01 Notnaton

Hey @tegnike! Great work on this one, would have around vision into the Python package much more quickly. Anton had a good profiles PR cooking so I knew we were heading that way instead, but I should have commented about that on this PR much sooner.

Closing in favor of the new profiles, but this is making me realize we need better instructions to set up vision in Python, so I'll update the docs in a moment.

KillianLucas avatar Mar 28 '24 22:03 KillianLucas