discord: https://discord.gg/cK9WeQ7jPq
a chrome extension that performs web browsing actions autonomously to complete a given goal/task (using LLM as a brain). )
follow instructions on the latest release: https://github.com/aj47/clickolas-cage/releases
- Check if your
Node.js
version is >= 14. run these commands to run everything locally:
cd clickolas-cage
npm install
npm run dev
Enter an OpenRouter API key in the popup settings.
This runs the chrome extension locally.
To link the running extension to chrome:
- set your Chrome browser 'Developer mode' on
- click 'Load unpacked', and select
clickolas-cage/build
folder
There are two ways to open the Clickolas Cage extension:
- Click on the extension icon in your Chrome toolbar.
- Use the keyboard shortcut:
- Windows/Linux:
Ctrl+Shift+L
- Mac:
Command+Shift+L
- Windows/Linux:
- Once the extension popup opens, you'll see a text area where you can enter your goal or task.
- You can type your goal directly or use the speech-to-text feature:
- Click the "Start Listening" button to activate speech recognition.
- Speak your goal clearly.
- Click "Stop Listening" when you're done.
- Click the "Submit" button or press Enter to start the automation process.
- A side panel will appear on the webpage, showing the steps Clickolas Cage is taking to complete your task.
- You can interact with the side panel to provide additional input or stop the execution if needed.
API Keys OpenRouter API key
To use Clickolas Cage, you'll need to provide OpenRouter API key for the language models. The extension supports multiple providers:
Once Clickolas Cage starts executing your task, a sidepanel will appear on the right side of the webpage. This panel provides the following controls and information:
-
Minimize/Maximize: Click the '▼' or '▲' button in the top-right corner to minimize or maximize the sidepanel.
-
Drag and Move: You can click and drag the top bar of the sidepanel to reposition it on the page.
-
Message List: The main area of the sidepanel displays messages, including:
- Steps being executed
- User inputs
- System messages
- Completion messages
-
Stop Execution: When Clickolas Cage is actively executing a task, a "Stop Execution" button will appear. Click this to halt the current operation.
-
Input Area: When not executing, you'll see an input field where you can:
- Type additional instructions or queries
- Use voice input for hands-free interaction
-
Voice Input:
- Click the "Voice" button or use the keyboard shortcut (Ctrl+Shift+K) to start/stop voice recognition.
- When active, it will display "Stop" and the button will highlight.
- Speak your instruction clearly, then click "Stop" or use the shortcut again to submit.
-
Send Button: Click "Send" or press Enter to submit your typed or voice-transcribed input.
The sidepanel allows you to monitor the progress of your task, provide additional input when needed, and control the execution of Clickolas Cage directly from the webpage.