clawrouter

Smart LLM router — save 67% on inference costs. Routes every request to the cheapest capable model across 41 models from OpenAI, Anthropic, Google, DeepSeek, and xAI.

3,891 stars
Complexity: easy

About this skill

ClawRouter is an essential `openclaw` skill designed to optimize LLM inference costs by intelligently routing each request to the most cost-effective yet capable model. It supports a diverse ecosystem of 41 models from leading providers including OpenAI, Anthropic, Google, DeepSeek, and xAI, all accessible through a unified interface. Users can expect savings of up to 67% on their LLM expenses, making high-volume AI interactions more economical. The skill operates by classifying each incoming request into one of four complexity tiers: SIMPLE, MEDIUM, COMPLEX, or REASONING. Based on this classification, it then directs the request to the cheapest model best suited for that specific task. For example, simple queries might go to Gemini Flash for maximum savings, while complex code generation could be routed to Claude Opus for optimal quality. Most routing decisions are made swiftly by rules (<1ms), with only ambiguous queries requiring a minimal-cost LLM classifier. Beyond automated cost optimization, ClawRouter offers flexibility, allowing users to enable smart routing globally or to pin a specific model when precise control is needed. This combination of intelligent automation and user-defined control ensures that both cost efficiency and task-specific performance requirements are met, streamlining LLM usage for a wide range of applications.

Best use case

ClawRouter is ideal for developers, businesses, and individual users who frequently interact with various LLMs and are looking to drastically cut down on their inference expenses. It's particularly beneficial for those managing high volumes of diverse AI tasks where model selection can significantly impact operational costs, allowing them to optimize expenditure without compromising on quality or performance.

Smart LLM router — save 67% on inference costs. Routes every request to the cheapest capable model across 41 models from OpenAI, Anthropic, Google, DeepSeek, and xAI.

Users should expect a substantial reduction in their overall LLM inference expenditures, coupled with intelligent model selection tailored to the complexity of each request.

Practical example

Example input

openclaw chat "Write a 50-word summary of the American Civil War's causes and main outcome."

Example output

[ClawRouter] google/gemini-2.5-flash (SIMPLE, rules, confidence=0.92)
             Cost: $0.0025 | Baseline: $0.308 | Saved: 99.2%

When to use this skill

  • To significantly reduce your overall LLM inference costs.
  • When needing to access a wide array of LLM models from different providers through a single interface.
  • For automatically selecting the most appropriate and cost-effective model for a given task's complexity.
  • If you want to simplify LLM provider management and billing into one 'wallet' via openclaw.

When not to use this skill

  • If you exclusively use a single, specific LLM and have no interest in cost optimization or alternative models.
  • If your LLM usage is extremely low, making the potential cost savings negligible.
  • If you require absolute, manual control over every single model invocation without any automated routing.
  • If you prefer to manage direct API keys and accounts for each individual LLM provider.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/clawrouter/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/1bcmax/clawrouter/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/clawrouter/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How clawrouter Compares

Feature / AgentclawrouterStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityeasyN/A

Frequently Asked Questions

What does this skill do?

Smart LLM router — save 67% on inference costs. Routes every request to the cheapest capable model across 41 models from OpenAI, Anthropic, Google, DeepSeek, and xAI.

How difficult is it to install?

The installation complexity is rated as easy. You can find the installation instructions above.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# ClawRouter

Smart LLM router that saves 67% on inference costs by routing each request to the cheapest model that can handle it. 41 models across 5 providers, all through one wallet.

## Install

```bash
openclaw plugins install @blockrun/clawrouter
```

## Setup

```bash
# Enable smart routing (auto-picks cheapest model per request)
openclaw models set blockrun/auto

# Or pin a specific model
openclaw models set openai/gpt-4o
```

## How Routing Works

ClawRouter classifies each request into one of four tiers:

- **SIMPLE** (40% of traffic) — factual lookups, greetings, translations → Gemini Flash ($0.60/M, 99% savings)
- **MEDIUM** (30%) — summaries, explanations, data extraction → DeepSeek Chat ($0.42/M, 99% savings)
- **COMPLEX** (20%) — code generation, multi-step analysis → Claude Opus ($75/M, best quality)
- **REASONING** (10%) — proofs, formal logic, multi-step math → o3 ($8/M, 89% savings)

Rules handle ~80% of requests in <1ms. Only ambiguous queries hit the LLM classifier (~$0.00003 per classification).

## Available Models

41 models including: gpt-5.2, gpt-4o, gpt-4o-mini, o3, o1, claude-opus-4.6, claude-sonnet-4.6, claude-haiku-4.5, gemini-3.1-pro, gemini-2.5-pro, gemini-2.5-flash, gemini-2.5-flash-lite, deepseek-chat, deepseek-reasoner, grok-3, grok-3-mini.

## Example Output

```
[ClawRouter] google/gemini-2.5-flash (SIMPLE, rules, confidence=0.92)
             Cost: $0.0025 | Baseline: $0.308 | Saved: 99.2%
```

Related Skills

---

3891
from openclaw/skills

name: article-factory-wechat

Content & Documentation

humanizer

3891
from openclaw/skills

Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comprehensive "Signs of AI writing" guide. Detects and fixes patterns including: inflated symbolism, promotional language, superficial -ing analyses, vague attributions, em dash overuse, rule of three, AI vocabulary words, negative parallelisms, and excessive conjunctive phrases.

Content & Documentation

find-skills

3891
from openclaw/skills

Helps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities. This skill should be used when the user is looking for functionality that might exist as an installable skill.

General Utilities

tavily-search

3891
from openclaw/skills

Use Tavily API for real-time web search and content extraction. Use when: user needs real-time web search results, research, or current information from the web. Requires Tavily API key.

Data & Research

baidu-search

3891
from openclaw/skills

Search the web using Baidu AI Search Engine (BDSE). Use for live information, documentation, or research topics.

Data & Research

agent-autonomy-kit

3891
from openclaw/skills

Stop waiting for prompts. Keep working.

Workflow & Productivity

Meeting Prep

3891
from openclaw/skills

Never walk into a meeting unprepared again. Your agent researches all attendees before calendar events—pulling LinkedIn profiles, recent company news, mutual connections, and conversation starters. Generates a briefing doc with talking points, icebreakers, and context so you show up informed and confident. Triggered automatically before meetings or on-demand. Configure research depth, advance timing, and output format. Walking into meetings blind is amateur hour—missed connections, generic small talk, zero leverage. Use when setting up meeting intelligence, researching specific attendees, generating pre-meeting briefs, or automating your prep workflow.

Workflow & Productivity

self-improvement

3891
from openclaw/skills

Captures learnings, errors, and corrections to enable continuous improvement. Use when: (1) A command or operation fails unexpectedly, (2) User corrects Claude ('No, that's wrong...', 'Actually...'), (3) User requests a capability that doesn't exist, (4) An external API or tool fails, (5) Claude realizes its knowledge is outdated or incorrect, (6) A better approach is discovered for a recurring task. Also review learnings before major tasks.

Agent Intelligence & Learning

botlearn-healthcheck

3891
from openclaw/skills

botlearn-healthcheck — BotLearn autonomous health inspector for OpenClaw instances across 5 domains (hardware, config, security, skills, autonomy); triggers on system check, health report, diagnostics, or scheduled heartbeat inspection.

DevOps & Infrastructure

linkedin-cli

3891
from openclaw/skills

A bird-like LinkedIn CLI for searching profiles, checking messages, and summarizing your feed using session cookies.

Content & Documentation

notebooklm

3891
from openclaw/skills

Google NotebookLM 非官方 Python API 的 OpenClaw Skill。支持内容生成(播客、视频、幻灯片、测验、思维导图等)、文档管理和研究自动化。当用户需要使用 NotebookLM 生成音频概述、视频、学习材料或管理知识库时触发。

Data & Research

小红书长图文发布 Skill

3891
from openclaw/skills

## 概述

Content & Documentation