fuji-web
fuji-web copied to clipboard
Fuji is an AI agent that lives in your browser's sidepanel. You can now get tasks done online with a single command!
Fuji-Web: AI based Full Browser Automation 🗻
Fuji-Web is an intelligent AI partner that understands the user’s intent, navigates websites autonomously, and executes tasks on the user’s behalf while explaining each action step.
Demo
https://github.com/normal-computing/fuji-web/assets/1001890/88a2fa12-31d9-4856-be67-27dcf9f1e634
How does it work?
Please read our blog post for a demo video, benchmarks and deep-dive technical overview!
Installing and Running
Download and Install the extension in your browser
- Go to the releases page, find the latest version of the extension and download "fuji-extension.zip".
- Unzip the file.
- Load your extension on Chrome by doing the following:
- Navigate to
chrome://extensions/ - Toggle
Developer mode - Click on
Load unpacked extension - Select the unzipped folder
- Navigate to
Use the extension
Please note that you may need to refresh the page for the extension to work.
- Find the Fuji icon in the top right corner of your browser and click on it to open the sidepanel.
- Create or access an existing OpenAI API Key and paste it in the provided box. This key will be stored in your browser, and will not be uploaded to a third party.
- Finally, navigate to a webpage you want Fuji-Web and type in the task you want it to perform.
Note: all prompts (text and image) are sent directly to the API of your selection. Fuji-Web does not attempt to collect any information from you.
Build the extension
If you want to build the extension from source, follow these instructions:
- Ensure you have Node.js. The development was done on Node v20 but it should work with some lower versions.
- Clone this repository
- Install
pnpmglobally:npm install -g pnpm - Run
pnpm install - Run
pnpm devto start the development server, orpnpm buildto build the extension.
When loading the extension, you will need to load the dist folder created by the build process.
Roadmap
- Expose API for easy integration with browser automation frameworks (e.g. Puppeteer, Playwright, Selenium)
- Add support for more complex & cross-tab workflows
- Add support for more browsing behaviors (select from dropdown, extract content from entire page etc.)
- Add support for saving workflows
- Add support for sharing workflows & instructions with others
- Create wikipedia-like knowledge base where users can work together to create knowledge that can improve the Fuji-Web's performance
Troubleshooting
Check out our Troubleshooting Guide for help with common problems.
Contributing
Interested in contributing to Fuji-Web? We'd love your help! Check out our Contribution Guide for guidelines on how to contribute, report bugs, suggest enhancements, and more.
We also have set up a dedicated channel for Fuji-Web feedback on Discord at https://discord.gg/yfMjZ8udb5.
Credits
- Fuji-Web's image annotation method was inspired by Microsoft's UFO paper.
- Fuji as a tool that lives in the browser sidepanel was inspired by TaxyAI's browser extension. We also used some of its UI code.
- The Chrome extension set-up leveraged an awesome boilerplate project Jonghakseo/chrome-extension-boilerplate-react-vite.
- The Fuji logo is from Toss Face Emoji design set.