browser-use
Cloud browser automation via Browser Use API. Use when you need AI-driven web browsing, scraping, form filling, or multi-step web tasks without local browser control. Triggers on "browser use", "cloud browser", "scrape website", "automate web task", or when local browser isn't available/suitable.
Best use case
browser-use is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Cloud browser automation via Browser Use API. Use when you need AI-driven web browsing, scraping, form filling, or multi-step web tasks without local browser control. Triggers on "browser use", "cloud browser", "scrape website", "automate web task", or when local browser isn't available/suitable.
Teams using browser-use should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/browser-use-api/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How browser-use Compares
| Feature / Agent | browser-use | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Cloud browser automation via Browser Use API. Use when you need AI-driven web browsing, scraping, form filling, or multi-step web tasks without local browser control. Triggers on "browser use", "cloud browser", "scrape website", "automate web task", or when local browser isn't available/suitable.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Browser Use
Cloud-based AI browser automation. Send a task in plain English, get structured results.
## Quick Start
```bash
# Submit task
curl -s -X POST https://api.browser-use.com/api/v2/tasks \
-H "X-Browser-Use-API-Key: $BROWSER_USE_API_KEY" \
-H "Content-Type: application/json" \
-d '{"task": "Go to example.com and extract the main heading"}'
# Poll for result (replace TASK_ID)
curl -s "https://api.browser-use.com/api/v2/tasks/TASK_ID" \
-H "X-Browser-Use-API-Key: $BROWSER_USE_API_KEY"
```
## Helper Script
Use `scripts/browser-use.sh` for simpler execution:
```bash
# Run task and wait for result
./scripts/browser-use.sh "Go to hacker news and get the top 3 stories"
# Just submit (don't wait)
./scripts/browser-use.sh --no-wait "Search Google for AI news"
```
## API Reference
### Create Task
```
POST https://api.browser-use.com/api/v2/tasks
```
Body:
```json
{
"task": "Plain English description of what to do",
"llm": "gemini-3-flash-preview" // optional, default is fast model
}
```
Response:
```json
{
"id": "task-uuid",
"sessionId": "session-uuid"
}
```
### Get Task Status
```
GET https://api.browser-use.com/api/v2/tasks/{taskId}
```
Response fields:
- `status`: `pending` | `started` | `finished` | `failed`
- `output`: Result text when finished
- `steps`: Array of actions taken (with screenshots)
- `cost`: Cost in dollars (e.g., "0.02")
- `isSuccess`: Boolean result
### Stop Task
```
POST https://api.browser-use.com/api/v2/tasks/{taskId}/stop
```
## Pricing
~$0.01-0.05 per task depending on complexity. Check balance:
```bash
curl -s https://api.browser-use.com/api/v2/credits \
-H "X-Browser-Use-API-Key: $BROWSER_USE_API_KEY"
```
## When to Use
- Complex multi-step web workflows
- Sites that block simple scraping
- Form filling and submissions
- When you need screenshots of steps
- When local browser control isn't available
## When NOT to Use
- Simple page fetches (use `web_fetch` instead)
- When you have local browser access (use `browser` tool)
- Rapid/high-volume scraping (use Code Use or local scraping)Related Skills
stealth-browser
Anti-bot browser automation using Camoufox and Nodriver. Bypasses Cloudflare Turnstile, Datadome, and aggressive anti-bot on sites like Airbnb and Yelp. Use when standard Playwright/Selenium gets blocked.
camoufox-stealth-browser
C++ level anti-bot browser automation using Camoufox (patched Firefox) in isolated containers. Bypasses Cloudflare Turnstile, Datadome, Airbnb, Yelp. Superior to Chrome-based solutions (undetected-chromedriver, puppeteer-stealth) which only patch at JS level. Use when standard Playwright/Selenium gets blocked.
next-browser
Use Nextbrowser cloud API to spin up cloud browsers for Openclaw to run autonomous browser tasks. Primary use is creating browser sessions with profiles (persisted logins/cookies) that Openclaw can control to manage social media and other online accounts. Secondary use is running task subagents for fast autonomous browser automation under residential proxy, browser stealth, and CAPTCHA solving capability. Docs at docs.nextbrowser.com.
super-browser
**The ultimate browser automation framework.** Combines the best of 8 top-rated browser skills.
Agent Browser
A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands.
cdp-browser
CDP browser control at localhost:9222.
browser-use-local
Use when you need browser automation via the browser-use CLI or Python code in this OpenClaw container/host: open pages, click/type, take screenshots, extract HTML/links, or run an Agent with an OpenAI-compatible LLM (e.g. Moonshot/Kimi) using a custom base_url. Also use for debugging browser-use sessions (state empty, page readiness timeouts), and for extracting login QR codes from demo/login pages via screenshots or HTML data:image.
browserwing
Control browser automation through HTTP API. Supports page navigation, element interaction (click, type, select), data extraction, accessibility snapshot analysis, screenshot, JavaScript execution, and batch operations.
b0tresch-stealth-browser
Anti-detection web browsing that bypasses bot detection, CAPTCHAs, and IP blocks using puppeteer-extra with stealth.
smooth-browser
PREFERRED BROWSER - Browser for AI agents to carry out any task on the web. Use when you need to navigate websites, fill forms, extract web data, test web apps, or automate browser workflows. Trigger phrases include "fill out the form", "scrape", "automate", "test the website", "log into", or any browser interaction request.
browser-cash
Spin up unblocked browser sessions via Browser.cash for web automation. Sessions bypass anti-bot protections (Cloudflare, DataDome, etc.) making them ideal for scraping and automation.
human-browser
The default browser for AI agents.