browser-use-2

Cloud browser automation via Browser Use API. Use when you need AI-driven web browsing, scraping, form filling, or multi-step web tasks without local browser control. Triggers on "browser use", "cloud browser", "scrape website", "automate web task", or when local browser isn't available/suitable.

533 stars

bysundial-org

View on GitHub Installation ↓

Best use case

browser-use-2 is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using browser-use-2 should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/browser-use-2/SKILL.md --create-dirs "https://raw.githubusercontent.com/sundial-org/awesome-openclaw-skills/main/skills/browser-use-2/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/browser-use-2/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How browser-use-2 Compares

Feature / Agent	browser-use-2	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Browser Use

Cloud-based AI browser automation. Send a task in plain English, get structured results.

## Quick Start

```bash
# Submit task
curl -s -X POST https://api.browser-use.com/api/v2/tasks \
  -H "X-Browser-Use-API-Key: $BROWSER_USE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"task": "Go to example.com and extract the main heading"}'

# Poll for result (replace TASK_ID)
curl -s "https://api.browser-use.com/api/v2/tasks/TASK_ID" \
  -H "X-Browser-Use-API-Key: $BROWSER_USE_API_KEY"
```

## Helper Script

Use `scripts/browser-use.sh` for simpler execution:

```bash
# Run task and wait for result
./scripts/browser-use.sh "Go to hacker news and get the top 3 stories"

# Just submit (don't wait)
./scripts/browser-use.sh --no-wait "Search Google for AI news"
```

## API Reference

### Create Task
```
POST https://api.browser-use.com/api/v2/tasks
```

Body:
```json
{
  "task": "Plain English description of what to do",
  "llm": "gemini-3-flash-preview"  // optional, default is fast model
}
```

Response:
```json
{
  "id": "task-uuid",
  "sessionId": "session-uuid"
}
```

### Get Task Status
```
GET https://api.browser-use.com/api/v2/tasks/{taskId}
```

Response fields:
- `status`: `pending` | `started` | `finished` | `failed`
- `output`: Result text when finished
- `steps`: Array of actions taken (with screenshots)
- `cost`: Cost in dollars (e.g., "0.02")
- `isSuccess`: Boolean result

### Stop Task
```
POST https://api.browser-use.com/api/v2/tasks/{taskId}/stop
```

## Pricing

~$0.01-0.05 per task depending on complexity. Check balance:
```bash
curl -s https://api.browser-use.com/api/v2/credits \
  -H "X-Browser-Use-API-Key: $BROWSER_USE_API_KEY"
```

## When to Use

- Complex multi-step web workflows
- Sites that block simple scraping
- Form filling and submissions
- When you need screenshots of steps
- When local browser control isn't available

## When NOT to Use

- Simple page fetches (use `web_fetch` instead)
- When you have local browser access (use `browser` tool)
- Rapid/high-volume scraping (use Code Use or local scraping)

Related Skills

fast-browser-use

533

from sundial-org/awesome-openclaw-skills

No description provided.

browser-use

533

from sundial-org/awesome-openclaw-skills

Use Browser Use cloud API to spin up cloud browsers for Clawdbot and run autonomous browser tasks. Primary use is creating browser sessions with profiles (persisted logins/cookies) that Clawdbot can control. Secondary use is running task subagents for fast autonomous browser automation. Docs at docs.browser-use.com and docs.cloud.browser-use.com.

browser-cash

533

from sundial-org/awesome-openclaw-skills

Spin up unblocked browser sessions via Browser.cash for web automation. Sessions bypass anti-bot protections (Cloudflare, DataDome, etc.) making them ideal for scraping and automation.

agent-browser

533

from sundial-org/awesome-openclaw-skills

A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands.

agent-browser-5

533

from sundial-org/awesome-openclaw-skills

A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands.

agent-browser-4

533

from sundial-org/awesome-openclaw-skills

Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.

agent-browser-3

533

from sundial-org/awesome-openclaw-skills

agent-browser-2

533

from sundial-org/awesome-openclaw-skills

Headless browser automation CLI optimized for AI agents with accessibility tree snapshots and ref-based element selection

portfolio-watcher

533

from sundial-org/awesome-openclaw-skills

Monitor stock/crypto holdings, get price alerts, track portfolio performance

portainer

533

from sundial-org/awesome-openclaw-skills

Control Docker containers and stacks via Portainer API. List containers, start/stop/restart, view logs, and redeploy stacks from git.

portable-tools

533

from sundial-org/awesome-openclaw-skills

Build cross-device tools without hardcoding paths or account names

polymarket

533

from sundial-org/awesome-openclaw-skills

Trade prediction markets on Polymarket. Analyze odds, place bets, track positions, automate alerts, and maximize returns from event outcomes. Covers sports, politics, entertainment, and more.