openrouter-model-routing
Implement intelligent model routing to optimize cost, quality, and latency on OpenRouter. Use when building multi-model systems or optimizing spend across task types. Triggers: 'openrouter routing', 'model routing', 'route to model', 'model selection openrouter'.
Best use case
openrouter-model-routing is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Implement intelligent model routing to optimize cost, quality, and latency on OpenRouter. Use when building multi-model systems or optimizing spend across task types. Triggers: 'openrouter routing', 'model routing', 'route to model', 'model selection openrouter'.
Teams using openrouter-model-routing should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/openrouter-model-routing/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How openrouter-model-routing Compares
| Feature / Agent | openrouter-model-routing | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Implement intelligent model routing to optimize cost, quality, and latency on OpenRouter. Use when building multi-model systems or optimizing spend across task types. Triggers: 'openrouter routing', 'model routing', 'route to model', 'model selection openrouter'.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
Best AI Skills for Claude
Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.
ChatGPT vs Claude for Agent Skills
Compare ChatGPT and Claude for AI agent skills across coding, writing, research, and reusable workflow execution.
SKILL.md Source
# OpenRouter Model Routing
## Overview
OpenRouter gives you access to 100+ models through one API. The key to cost efficiency is routing each request to the right model based on task complexity, required capabilities, cost budget, and latency requirements. This skill covers task-based routing, complexity classification, cost-aware selection, and OpenRouter's native routing features.
## Task-Based Router
```python
import os, re
from openai import OpenAI
client = OpenAI(
base_url="https://openrouter.ai/api/v1",
api_key=os.environ["OPENROUTER_API_KEY"],
default_headers={"HTTP-Referer": "https://my-app.com", "X-Title": "my-app"},
)
# Model tiers by cost and capability
MODELS = {
"free": "google/gemma-2-9b-it:free", # $0/0 — testing only
"budget": "meta-llama/llama-3.1-8b-instruct", # $0.06/$0.06 per 1M
"mid": "openai/gpt-4o-mini", # $0.15/$0.60 per 1M
"standard":"anthropic/claude-3.5-sonnet", # $3/$15 per 1M
"premium": "openai/o1", # $15/$60 per 1M
}
TASK_ROUTING = {
"classification": "budget", # Simple label assignment
"translation": "mid", # Moderate quality needed
"summarization": "mid", # Good quality, cost-effective
"code_generation": "standard", # Needs high accuracy
"code_review": "standard", # Needs reasoning
"analysis": "standard", # Complex reasoning
"creative_writing":"standard", # Quality matters
"deep_reasoning": "premium", # Multi-step logic
"simple_qa": "budget", # Basic questions
"chat": "mid", # General conversation
}
def route_request(task_type: str, messages: list[dict], **kwargs) -> dict:
"""Route to appropriate model based on task type."""
tier = TASK_ROUTING.get(task_type, "mid")
model = MODELS[tier]
response = client.chat.completions.create(
model=model, messages=messages, **kwargs
)
return {
"content": response.choices[0].message.content,
"model": response.model,
"tier": tier,
"tokens": response.usage.prompt_tokens + response.usage.completion_tokens,
}
```
## Complexity-Based Auto-Router
```python
def classify_complexity(prompt: str) -> str:
"""Classify prompt complexity to select model tier.
Simple heuristics -- replace with a trained classifier for production.
"""
word_count = len(prompt.split())
has_code = bool(re.search(r'```|def |function |class |import ', prompt))
has_reasoning = bool(re.search(r'explain|analyze|compare|why|how does|trade.?off', prompt, re.I))
has_math = bool(re.search(r'calculate|equation|formula|derive|proof', prompt, re.I))
if has_math or (has_reasoning and has_code):
return "premium"
if has_code or has_reasoning or word_count > 500:
return "standard"
if word_count > 100:
return "mid"
return "budget"
def auto_route(messages: list[dict], **kwargs):
"""Automatically select model based on prompt complexity."""
user_msg = next((m["content"] for m in reversed(messages) if m["role"] == "user"), "")
tier = classify_complexity(user_msg)
model = MODELS[tier]
response = client.chat.completions.create(model=model, messages=messages, **kwargs)
return response
```
## OpenRouter Native Routing
```python
# Route: "fallback" — try models in order until one succeeds
response = client.chat.completions.create(
model="anthropic/claude-3.5-sonnet",
messages=[{"role": "user", "content": "Hello"}],
max_tokens=200,
extra_body={
"models": [
"anthropic/claude-3.5-sonnet",
"openai/gpt-4o",
"openai/gpt-4o-mini",
],
"route": "fallback",
},
)
# Provider routing — control which provider serves a model
response = client.chat.completions.create(
model="anthropic/claude-3.5-sonnet",
messages=[{"role": "user", "content": "Hello"}],
max_tokens=200,
extra_body={
"provider": {
"order": ["Anthropic", "AWS Bedrock"],
"allow_fallbacks": True,
},
},
)
# Model variant: ":floor" picks cheapest provider
response = client.chat.completions.create(
model="anthropic/claude-3.5-sonnet:floor",
messages=[{"role": "user", "content": "Hello"}],
max_tokens=200,
)
```
## Cost-Aware Router
```python
import requests
def get_model_pricing() -> dict:
"""Fetch current pricing for cost-aware routing."""
models = requests.get("https://openrouter.ai/api/v1/models").json()["data"]
return {
m["id"]: {
"prompt": float(m["pricing"]["prompt"]) * 1_000_000,
"completion": float(m["pricing"]["completion"]) * 1_000_000,
"context": m["context_length"],
}
for m in models
}
def cheapest_model_for_task(pricing: dict, min_context: int = 4096,
needs_tools: bool = False) -> str:
"""Find the cheapest model that meets requirements."""
candidates = [
(mid, p) for mid, p in pricing.items()
if p["context"] >= min_context and p["prompt"] > 0 # Exclude free (unreliable)
]
candidates.sort(key=lambda x: x[1]["prompt"] + x[1]["completion"])
return candidates[0][0] if candidates else "openai/gpt-4o-mini"
```
## Error Handling
| Error | Cause | Fix |
|-------|-------|-----|
| Wrong model selected | Classification too coarse | Add more task categories; test with diverse prompts |
| Model unavailable | Selected model temporarily down | Add fallback chain per tier |
| Cost overrun | Complex tasks routed to premium models | Set `max_tokens` and daily budget caps |
| Quality regression | Budget model can't handle task | Monitor output quality; escalate tier on poor results |
## Enterprise Considerations
- Start with manual task-type routing (explicit labels), then graduate to auto-classification
- Log every routing decision (task type, tier, model, cost) to tune the router over time
- Use OpenRouter's `:floor` variant to automatically get the cheapest provider for any model
- Set `max_tokens` on every request to cap per-request cost regardless of model tier
- A/B test routing rules: send 10% of traffic to a different tier and compare quality metrics
- Combine with fallback chains so each tier has backup models
## References
- [Examples](${CLAUDE_SKILL_DIR}/references/examples.md) | [Errors](${CLAUDE_SKILL_DIR}/references/errors.md)
- [Model Routing](https://openrouter.ai/docs/features/model-routing) | [Provider Routing](https://openrouter.ai/docs/features/provider-routing)Related Skills
openrouter-usage-analytics
Track and analyze OpenRouter API usage patterns, costs, and performance. Use when building dashboards, optimizing spend, or reporting on AI usage. Triggers: 'openrouter analytics', 'openrouter usage', 'openrouter metrics', 'track openrouter spend'.
openrouter-upgrade-migration
Migrate to OpenRouter from direct provider APIs or upgrade between SDK/model versions. Triggers: 'openrouter migrate', 'openrouter upgrade', 'switch to openrouter', 'migrate from openai to openrouter'.
openrouter-team-setup
Configure OpenRouter for multi-user teams with per-user keys, budget controls, and usage attribution. Triggers: 'openrouter team', 'openrouter multi-user', 'openrouter organization', 'team api keys openrouter'.
openrouter-routing-rules
Define custom routing rules for OpenRouter requests based on user tier, task type, cost budget, and availability. Triggers: 'openrouter rules', 'routing rules', 'custom routing openrouter', 'conditional model selection'.
openrouter-reference-architecture
Design production architectures using OpenRouter as the LLM gateway. Use when planning system design, reviewing architecture, or scaling AI applications. Triggers: 'openrouter architecture', 'openrouter system design', 'openrouter at scale', 'llm gateway architecture'.
openrouter-rate-limits
Understand and handle OpenRouter rate limits. Use when hitting 429 errors, building high-throughput systems, or implementing retry logic. Triggers: 'openrouter rate limit', 'openrouter 429', 'openrouter throttle', 'rate limiting openrouter'.
openrouter-prod-checklist
Validate production readiness of your OpenRouter integration. Use before launching to production or during operational reviews. Triggers: 'openrouter production', 'openrouter launch', 'production checklist openrouter', 'openrouter deploy'.
openrouter-pricing-basics
Understand OpenRouter pricing, calculate costs, and optimize spend. Use when budgeting, comparing model costs, or tracking spend. Triggers: 'openrouter pricing', 'openrouter cost', 'model pricing', 'openrouter budget', 'how much does openrouter cost'.
openrouter-performance-tuning
Optimize OpenRouter request latency and throughput. Use when building real-time applications, reducing TTFT, or scaling request volume. Triggers: 'openrouter performance', 'openrouter latency', 'openrouter speed', 'optimize openrouter throughput'.
openrouter-openai-compat
Migrate from OpenAI to OpenRouter with minimal code changes. Use when switching to OpenRouter or maintaining dual compatibility. Triggers: 'openrouter openai compatible', 'openrouter drop-in', 'openai to openrouter', 'openrouter migration'.
openrouter-multi-provider
Use multiple AI providers (OpenAI, Anthropic, Google, Meta) through OpenRouter's unified API. Use when comparing providers, building cross-provider workflows, or maximizing availability. Triggers: 'openrouter providers', 'multi provider', 'openrouter openai anthropic', 'compare models openrouter'.
openrouter-model-catalog
Query, filter, and select from OpenRouter's 400+ model catalog. Use when choosing models, comparing pricing, or checking capabilities. Triggers: 'openrouter models', 'list models', 'model catalog', 'compare models', 'available models'.