self-operating-computer
self-operating-computer copied to clipboard
A framework to enable multimodal models to operate a computer.
### Is your feature request related to a problem? Please describe. Does this work with Azure Open AI today? We can add that support. ### Describe the solution you'd like...
Found a bug? Please fill out the sections below. 👍 ### Describe the bug When i launch "operate" and give my OpenAI key, this is the result : Hello, I...
Refs: #171 ## What does this PR do? Adds ability to use remote ollama server Fixes # (issue) ## Requirement/Documentation - If there is a requirement document, please, share it...
If there is some learning process before the actual task it would be working accurately rather than navigating to unnecessary places or clicking on to wrong options. Like AppAgent which...
Consider adding a human intervention? Like AutoGPT, there are times when the program repeats an impossible task, and we can manually intervene.
I am using linux with ollama. After ollama pull llava and ollama serve, operate -m llava return error local variable 'content' referenced before assignment ### Steps to Reproduce 1. Alternatively...
What i mean by GUI interface is to have some sort of interface rather than from terminal so whenever there is a task its handy to run things on the...
At the moment the self operating computer has a bit of an idea on the given task but it is slow and buggy. Trying to get it to select emails...
### Is your feature request related to a problem? Please describe. I'd like to be able to use an ollama instance running on a different machine. ### Describe the solution...
Found a bug? Please fill out the sections below. 👍 ### Describe the bug Ran `operate -m gemini-pro-vision`, entered my gemini API key from google AIstudio, but when I request...