Fuji-Web: AI based Full Browser Automation 🗻

Fuji-Web is an intelligent AI partner that understands the user’s intent, navigates websites autonomously, and executes tasks on the user’s behalf while explaining each action step.

Demo

https://github.com/normal-computing/fuji-web/assets/1001890/88a2fa12-31d9-4856-be67-27dcf9f1e634

How does it work?

Please read our blog post for a demo video, benchmarks and deep-dive technical overview!

Installing and Running

Download and Install the extension in your browser

Go to the releases page, find the latest version of the extension and download "fuji-extension.zip".
Unzip the file.
Load your extension on Chrome by doing the following:
1. Navigate to chrome://extensions/
2. Toggle Developer mode
3. Click on Load unpacked extension
4. Select the unzipped folder

Use the extension

Please note that you may need to refresh the page for the extension to work.

Find the Fuji icon in the top right corner of your browser and click on it to open the sidepanel.
Create or access an existing OpenAI API Key and paste it in the provided box. This key will be stored in your browser, and will not be uploaded to a third party.
Finally, navigate to a webpage you want Fuji-Web and type in the task you want it to perform.

Note: all prompts (text and image) are sent directly to the API of your selection. Fuji-Web does not attempt to collect any information from you.

Build the extension

If you want to build the extension from source, follow these instructions:

Ensure you have Node.js. The development was done on Node v20 but it should work with some lower versions.
Clone this repository
Install pnpm globally: npm install -g pnpm
Run pnpm install
Run pnpm dev to start the development server, or pnpm build to build the extension.

When loading the extension, you will need to load the dist folder created by the build process.

Roadmap

Expose API for easy integration with browser automation frameworks (e.g. Puppeteer, Playwright, Selenium)
Add support for more complex & cross-tab workflows
Add support for more browsing behaviors (select from dropdown, extract content from entire page etc.)
Add support for saving workflows
Add support for sharing workflows & instructions with others
Create wikipedia-like knowledge base where users can work together to create knowledge that can improve the Fuji-Web's performance

Troubleshooting

Check out our Troubleshooting Guide for help with common problems.

Contributing

Interested in contributing to Fuji-Web? We'd love your help! Check out our Contribution Guide for guidelines on how to contribute, report bugs, suggest enhancements, and more.

We also have set up a dedicated channel for Fuji-Web feedback on Discord at https://discord.gg/yfMjZ8udb5.

Credits

Fuji-Web's image annotation method was inspired by Microsoft's UFO paper.
Fuji as a tool that lives in the browser sidepanel was inspired by TaxyAI's browser extension. We also used some of its UI code.
The Chrome extension set-up leveraged an awesome boilerplate project Jonghakseo/chrome-extension-boilerplate-react-vite.
The Fuji logo is from Toss Face Emoji design set.

fuji-web
fuji-web copied to clipboard

Metadata

Fuji-Web: AI based Full Browser Automation 🗻

Demo

How does it work?

Installing and Running

Download and Install the extension in your browser

Use the extension

Build the extension

Roadmap

Troubleshooting

Contributing

Credits

← Metadata

Owner

Metadata

fuji-web fuji-web copied to clipboard

Metadata

Fuji-Web: AI based Full Browser Automation 🗻

Demo

How does it work?

Installing and Running

Download and Install the extension in your browser

Use the extension

Build the extension

Roadmap

Troubleshooting

Contributing

Credits

← Metadata

Owner

Metadata

fuji-web
fuji-web copied to clipboard