agent-tester

Test agent: dry-run, unit, integration, compatibility

33 stars

byaAAaqwq

View on GitHub Installation ↓

Best use case

agent-tester is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Test agent: dry-run, unit, integration, compatibility

Teams using agent-tester should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/agent-tester/SKILL.md --create-dirs "https://raw.githubusercontent.com/aAAaqwq/AGI-Super-Team/main/skills/agent-tester/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/agent-tester/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How agent-tester Compares

Feature / Agent	agent-tester	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Test agent: dry-run, unit, integration, compatibility

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Agent Tester

> Tests a built agent: dry-run, unit tests, integration, compatibility with other agents.

## When to use

- After Agent Builder has finished
- "test agent X"
- "check agent compatibility"

## Input

- Agent from `$AGENTS_PATH/[name]/`
- Spec from `$AGENTS_PATH/specs/[name].spec.md`

## How to execute

### Step 1: Static analysis

Check the agent code:

- [ ] File exists and runs without syntax errors
- [ ] All imports resolve
- [ ] Config file is valid
- [ ] Paths in config exist
- [ ] Credentials are accessible
- [ ] Dry-run mode is implemented

### Step 2: Dry-run test

Run the agent with `--dry-run`:

```bash
python3 $AGENTS_PATH/[name]/[name]_agent.py --dry-run
```

Check:
- [ ] Agent starts without errors
- [ ] Logs are clear
- [ ] Shows what it WOULD do (without real side effects)
- [ ] Execution time is reasonable

### Step 3: Unit tests

Run tests:

```bash
python3 -m pytest $AGENTS_PATH/[name]/test_[name].py -v
```

Minimum tests:
- [ ] Input parsing works
- [ ] Business logic is correct on test data
- [ ] Error handling works (bad input, missing files, API timeout)
- [ ] Output format is correct

### Step 4: Integration test (one run on real data)

**WARNING: only with human approval!**

1. Back up data that the agent modifies:
```bash
cp [target.csv] [target.csv.backup]
```

2. Run the agent once on real data
3. Check output:
   - [ ] Data was written correctly
   - [ ] Format matches schema.yaml
   - [ ] Nothing broke
   - [ ] Git commit was created (if needed)

4. If something is wrong -- rollback:
```bash
cp [target.csv.backup] [target.csv]
```

### Step 5: Compatibility test

Check that the new agent does not conflict with existing ones:

```markdown
## Compatibility Matrix

| Agent | Shared Files | Potential Conflict | Status |
|-------|-------------|-------------------|--------|
| Email Pipeline | activities.csv | Write conflict | ? |
| [other agents] | ... | ... | ? |
```

Specific checks:
- [ ] **File locks**: can two agents write to the same CSV simultaneously
- [ ] **Data consistency**: does the agent overwrite another agent's data
- [ ] **ID generation**: do IDs conflict (person_id, activity_id, etc.)
- [ ] **Schedule overlap**: do agents run at the same time
- [ ] **Git conflicts**: does auto-commit create merge conflicts

### Step 6: Report

Create a test report file:

```
$AGENTS_PATH/specs/[name].test-report.md
```

**Report structure:**

```markdown
# Test Report: [Agent Name]

## Date: YYYY-MM-DD
## Tester: Process Analyst Agent

## Results

| Test | Status | Notes |
|------|--------|-------|
| Static analysis | PASS/FAIL | |
| Dry-run | PASS/FAIL | |
| Unit tests | PASS/FAIL | X/Y passed |
| Integration | PASS/FAIL | |
| Compatibility | PASS/FAIL | |

## Issues Found
1. [Issue description + severity]

## Recommendation
- [ ] READY for production
- [ ] NEEDS FIXES (list what)
- [ ] BLOCKED (list why)
```

## Output

- Test report in `$AGENTS_PATH/specs/[name].test-report.md`
- PASS/FAIL verdict
- List of issues if any

## Related skills

- `process-analyst` — creates the spec
- `agent-builder` — builds the agent
- `change-review` — validates CRM/PM changes

Related Skills

wemp-operator

from aAAaqwq/AGI-Super-Team

> 微信公众号全功能运营——草稿/发布/评论/用户/素材/群发/统计/菜单/二维码 API 封装

Content & Documentation

zsxq-smart-publish

from aAAaqwq/AGI-Super-Team

Publish and manage content on 知识星球 (zsxq.com). Supports talk posts, Q&A, long articles, file sharing, digest/bookmark, homework tasks, and tag management. Use when publishing content to 知识星球, creating/editing posts, uploading files/images/audio, managing digests, batch publishing, or formatting content for 知识星球.

zoom-automation

from aAAaqwq/AGI-Super-Team

Automate Zoom meeting creation, management, recordings, webinars, and participant tracking via Rube MCP (Composio). Always search tools first for current schemas.

zoho-crm-automation

from aAAaqwq/AGI-Super-Team

Automate Zoho CRM tasks via Rube MCP (Composio): create/update records, search contacts, manage leads, and convert leads. Always search tools first for current schemas.

ziliu-publisher

from aAAaqwq/AGI-Super-Team

字流(Ziliu) - AI驱动的多平台内容分发工具。用于一次创作、智能适配排版、一键分发到16+平台（公众号/知乎/小红书/B站/抖音/微博/X等）。当用户需要多平台发布、内容排版、格式适配时使用。触发词：字流、ziliu、多平台发布、一键分发、内容分发、排版发布。

zhihu-post-skill

from aAAaqwq/AGI-Super-Team

> 知乎文章发布——知乎平台内容创作与发布自动化

zendesk-automation

from aAAaqwq/AGI-Super-Team

Automate Zendesk tasks via Rube MCP (Composio): tickets, users, organizations, replies. Always search tools first for current schemas.

youtube-knowledge-extractor

from aAAaqwq/AGI-Super-Team

This skill performs deep analysis of YouTube videos through **both information channels** Multimodal YouTube video analysis through both audio (transcript) and visual (frame extraction + image analysis) channels. Especially powerful for HowTo videos, tutorials, demos, and explainer videos where what is SHOWN (screenshots, UI demos, diagrams, code, physical actions) is just as important as what is SAID. Use this skill whenever a user wants to analyze, summarize, or create step-by-step guides from YouTube videos, or when they share a YouTube URL and want to understand what happens in the video. Triggers on requests like "Analyze this YouTube video", "Create a step-by-step guide from this video", "What does this video show?", "Summarize this tutorial", or any YouTube URL shared with analysis intent.

youtube-factory

from aAAaqwq/AGI-Super-Team

Generate complete YouTube videos from a single prompt - script, voiceover, stock footage, captions, thumbnail. Self-contained, no external modules. 100% free tools.

youtube-automation

from aAAaqwq/AGI-Super-Team

Automate YouTube tasks via Rube MCP (Composio): upload videos, manage playlists, search content, get analytics, and handle comments. Always search tools first for current schemas.

xlsx

from aAAaqwq/AGI-Super-Team

Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas

xiaomo-assistant-template

from aAAaqwq/AGI-Super-Team

小a助手配置模板。基于 xiaomo-starter-kit 改编，提供预配置的 OpenClaw 助手框架文件。当用户需要快速配置新助手、设置助手身份、创建助手配置文件时使用此技能。