Getting Started
Record browser interactions and get automation code. This guide will walk you through installation, capturing your first interaction, and generating reusable code.
Installation
Install the Chrome Extension
Download and install the InverseUI Chrome extension to start capturing UI interactions:
- Download the extension from GitHub
- Unzip the downloaded file
- Open Chrome and go to
chrome://extensions - Enable "Developer mode" in the top right corner
- Click "Load unpacked" and select the unzipped folder
- Sign in with your Google account
Once installed, you'll see the InverseUI icon in your browser toolbar. Click it to start recording your first interaction.
First Project
Let's capture your first UI interaction and generate automation code.
1. Start Recording
Click the InverseUI extension icon and perform any actions you want to automate:
- Click the InverseUI extension icon
- Select "Start Recording"
- Perform any actions in the browser (clicks, typing, navigation, etc.)
- Click "Stop Recording" when you're done
2. Review Recording
After recording, InverseUI captures all your interactions:
Actions Captured
All clicks, inputs, and navigations are recorded
Ready-to-Use Functions
Generates typed parameters, docstrings, and defaults - perfect for LLM agents
3. Generate Code
After recording, InverseUI will automatically generate code that you can use in your automation scripts. The generated code includes all the interactions you recorded, with parameters automatically detected for reusable inputs.
Download the generated Playwright code and integrate it into your existing automation framework, or use our Recording SDK to programmatically control the recording process.
Basic Concepts
Understanding these core concepts will help you get the most out of InverseUI.
How It Works
InverseUI captures your interactions and generates automation code:
- Records exact UI interactions
- Identifies variable inputs (parameters)
- Generates clean Playwright/Selenium code
Why Agents Love InverseUI
Traditional agents waste tokens and time parsing UI. InverseUI gives them muscle memory:
- Token Efficient: 100x fewer tokens per action compared to UI parsing
- Lightning Fast: Direct function calls instead of multi-step reasoning
- Reliable: No more brittle chains of LLM reasoning