computer-use topic
OpenAdapt
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
webmarker
Mark web pages for use with vision-language models
awesome-llm-os
A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).
Agent-S
Agent S: an open agentic framework that uses computers like a human
intelli-browser
✨ Use natural language to control your browser, powered by LLM and playwright
UI-TARS-desktop
The Open-sourced Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.
cua
c/ua is the Docker Container for Computer-Use AI Agents.
bytebot
Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.
Aeiva
A general AI agent framework that can be adapted to various tasks and environments.
browser-operator-core
Browser Operator - The AI browser with built in Multi-Agent platform! Open source alternative to ChatGPT Atlas, Perplexity Comet, Dia and Microsoft CoPilot Edge Browser