OpenAI Agents SDK — Build Production AI Agents
You are an expert in the OpenAI Agents SDK (formerly Swarm), the official framework for building multi-agent systems. You help developers create agents with tool calling, guardrails, agent handoffs, streaming, tracing, and MCP integration — building production-grade AI agents that coordinate, delegate tasks, and execute tools with built-in safety controls.
Best use case
OpenAI Agents SDK — Build Production AI Agents is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
You are an expert in the OpenAI Agents SDK (formerly Swarm), the official framework for building multi-agent systems. You help developers create agents with tool calling, guardrails, agent handoffs, streaming, tracing, and MCP integration — building production-grade AI agents that coordinate, delegate tasks, and execute tools with built-in safety controls.
Teams using OpenAI Agents SDK — Build Production AI Agents should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/openai-agents/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How OpenAI Agents SDK — Build Production AI Agents Compares
| Feature / Agent | OpenAI Agents SDK — Build Production AI Agents | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
You are an expert in the OpenAI Agents SDK (formerly Swarm), the official framework for building multi-agent systems. You help developers create agents with tool calling, guardrails, agent handoffs, streaming, tracing, and MCP integration — building production-grade AI agents that coordinate, delegate tasks, and execute tools with built-in safety controls.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# OpenAI Agents SDK — Build Production AI Agents
You are an expert in the OpenAI Agents SDK (formerly Swarm), the official framework for building multi-agent systems. You help developers create agents with tool calling, guardrails, agent handoffs, streaming, tracing, and MCP integration — building production-grade AI agents that coordinate, delegate tasks, and execute tools with built-in safety controls.
## Core Capabilities
### Agent Definition
```python
# agents/customer_support.py — Multi-agent customer support system
from agents import Agent, Runner, function_tool, GuardrailFunctionOutput, InputGuardrail
from pydantic import BaseModel
class OrderInfo(BaseModel):
order_id: str
status: str
total: float
items: list[str]
@function_tool
async def lookup_order(order_id: str) -> OrderInfo:
"""Look up an order by ID.
Args:
order_id: The order identifier (e.g., ORD-12345)
"""
order = await db.orders.find_by_id(order_id)
return OrderInfo(
order_id=order.id,
status=order.status,
total=order.total,
items=[item.name for item in order.items],
)
@function_tool
async def initiate_refund(order_id: str, reason: str) -> str:
"""Initiate a refund for an order.
Args:
order_id: The order to refund
reason: Reason for the refund
"""
result = await payments.refund(order_id, reason)
return f"Refund initiated: ${result.amount}. Reference: {result.reference_id}"
@function_tool
async def escalate_to_human(summary: str) -> str:
"""Escalate to a human agent when the issue is too complex.
Args:
summary: Brief summary of the issue for the human agent
"""
ticket = await support.create_ticket(summary, priority="high")
return f"Escalated to human agent. Ticket: {ticket.id}"
# Triage agent — routes to the right specialist
triage_agent = Agent(
name="Triage",
instructions="""You are a customer support triage agent.
Determine the customer's issue and hand off to the appropriate specialist:
- Order issues → Order Specialist
- Billing/refund → Billing Specialist
- Technical problems → escalate to human""",
handoffs=["order_specialist", "billing_specialist"],
tools=[escalate_to_human],
)
# Specialist agents
order_specialist = Agent(
name="Order Specialist",
instructions="You handle order-related inquiries. Look up orders, provide status updates, and help with modifications.",
tools=[lookup_order],
handoffs=["billing_specialist"], # Can hand off to billing if needed
)
billing_specialist = Agent(
name="Billing Specialist",
instructions="You handle billing and refund requests. Verify orders before processing refunds. Maximum refund without approval: $500.",
tools=[lookup_order, initiate_refund],
)
```
### Guardrails
```python
# Input guardrail — runs before the agent processes the message
class ContentCheck(BaseModel):
is_appropriate: bool
reasoning: str
async def content_guardrail(ctx, agent, input) -> GuardrailFunctionOutput:
"""Check if user input is appropriate before processing."""
result = await Runner.run(
Agent(
name="Content Checker",
instructions="Check if the input is a legitimate customer support request. Flag inappropriate content.",
output_type=ContentCheck,
),
input,
context=ctx,
)
return GuardrailFunctionOutput(
output_info=result.final_output,
tripwire_triggered=not result.final_output.is_appropriate,
)
triage_agent = Agent(
name="Triage",
instructions="...",
input_guardrails=[InputGuardrail(guardrail_function=content_guardrail)],
handoffs=["order_specialist", "billing_specialist"],
)
```
### Running Agents
```python
from agents import Runner
# Single turn
result = await Runner.run(
triage_agent,
"I want a refund for order ORD-12345, the product arrived damaged",
)
print(result.final_output)
# Agent flow: Triage → Billing Specialist → lookup_order → initiate_refund
# Streaming
async for event in Runner.run_streamed(triage_agent, user_message):
if event.type == "raw_response_event":
if hasattr(event.data, "delta"):
print(event.data.delta, end="")
elif event.type == "agent_updated_stream_event":
print(f"\n[Handed off to: {event.new_agent.name}]")
elif event.type == "tool_call_event":
print(f"\n[Calling tool: {event.tool_name}]")
# With MCP servers
from agents.mcp import MCPServerStdio
async with MCPServerStdio(command="npx", args=["-y", "@modelcontextprotocol/server-filesystem", "/data"]) as mcp:
agent = Agent(
name="File Assistant",
instructions="Help users manage files",
mcp_servers=[mcp],
)
result = await Runner.run(agent, "List all Python files in /data")
```
## Installation
```bash
pip install openai-agents
```
## Best Practices
1. **Triage + specialists** — Use a triage agent for routing; specialist agents for domain-specific tasks
2. **Guardrails** — Add input/output guardrails for content filtering, PII detection, policy enforcement
3. **Handoffs** — Use handoffs for agent delegation; cheaper than one mega-agent with all tools
4. **Structured output** — Use `output_type` with Pydantic models for typed, validated agent responses
5. **Tool design** — Make tools focused (one action each); clear docstrings help the agent use them correctly
6. **Tracing** — Enable tracing for debugging agent decisions, tool calls, and handoff chains
7. **MCP integration** — Connect MCP servers for file access, database queries, API calls without custom tools
8. **Streaming** — Use `run_streamed` for real-time output; show tool calls and handoffs to users for transparencyRelated Skills
vertex-agent-builder
Build and deploy production-ready generative AI agents using Vertex AI, Gemini models, and Google Cloud infrastructure with RAG, function calling, and multi-modal capabilities
test-data-builder
Test Data Builder - Auto-activating skill for Test Automation. Triggers on: test data builder, test data builder Part of the Test Automation skill category.
building-terraform-modules
This skill empowers Claude to build reusable Terraform modules based on user specifications. It leverages the terraform-module-builder plugin to generate production-ready, well-documented Terraform module code, incorporating best practices for security, scalability, and multi-platform support. Use this skill when the user requests to create a new Terraform module, generate Terraform configuration, or needs help structuring infrastructure as code using Terraform. The trigger terms include "create Terraform module," "generate Terraform configuration," "Terraform module code," and "infrastructure as code."
sklearn-pipeline-builder
Sklearn Pipeline Builder - Auto-activating skill for ML Training. Triggers on: sklearn pipeline builder, sklearn pipeline builder Part of the ML Training skill category.
sam-template-builder
Sam Template Builder - Auto-activating skill for AWS Skills. Triggers on: sam template builder, sam template builder Part of the AWS Skills skill category.
building-recommendation-systems
This skill empowers Claude to construct recommendation systems using collaborative filtering, content-based filtering, or hybrid approaches. It analyzes user preferences, item features, and interaction data to generate personalized recommendations. Use this skill when the user requests to build a recommendation engine, needs help with collaborative filtering, wants to implement content-based filtering, or seeks to rank items based on relevance for a specific user or group of users. It is triggered by requests involving "recommendations", "collaborative filtering", "content-based filtering", "ranking items", or "building a recommender".
prefect-flow-builder
Prefect Flow Builder - Auto-activating skill for Data Pipelines. Triggers on: prefect flow builder, prefect flow builder Part of the Data Pipelines skill category.
building-neural-networks
This skill allows Claude to construct and configure neural network architectures using the neural-network-builder plugin. It should be used when the user requests the creation of a new neural network, modification of an existing one, or assistance with defining the layers, parameters, and training process. The skill is triggered by requests involving terms like "build a neural network," "define network architecture," "configure layers," or specific mentions of neural network types (e.g., "CNN," "RNN," "transformer").
graphql-mutation-builder
Graphql Mutation Builder - Auto-activating skill for API Development. Triggers on: graphql mutation builder, graphql mutation builder Part of the API Development skill category.
building-gitops-workflows
This skill enables Claude to construct GitOps workflows using ArgoCD and Flux. It is designed to generate production-ready configurations, implement best practices, and ensure a security-first approach for Kubernetes deployments. Use this skill when the user explicitly requests "GitOps workflow", "ArgoCD", "Flux", or asks for help with setting up a continuous delivery pipeline using GitOps principles. The skill will generate the necessary configuration files and setup code based on the user's specific requirements and infrastructure.
genkit-production-expert
Build production Firebase Genkit applications including RAG systems, multi-step flows, and tool calling for Node.js/Python/Go. Deploy to Firebase Functions or Cloud Run with AI monitoring. Use when asked to "create genkit flow" or "implement RAG". Trigger with relevant phrases based on skill purpose.
funnel-analysis-builder
Funnel Analysis Builder - Auto-activating skill for Data Analytics. Triggers on: funnel analysis builder, funnel analysis builder Part of the Data Analytics skill category.