superpowers-tdd

Use when implementing any feature or bugfix, before writing implementation code - enforces RED-GREEN-REFACTOR cycle: write failing test first, verify it fails, write minimal code, verify it passes, then refactor

3,891 stars

byopenclaw

View on GitHub Installation ↓

Best use case

superpowers-tdd is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using superpowers-tdd should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/superpowers-tdd/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/axelhu/superpowers-tdd/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/superpowers-tdd/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How superpowers-tdd Compares

Feature / Agent	superpowers-tdd	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

AI Agents for Coding

Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.

Best AI Skills for Claude

Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.

Cursor vs Codex for AI Workflows

Compare Cursor and Codex for AI coding workflows, repository assistance, debugging, refactoring, and reusable developer skills.

SKILL.md Source

# Superpowers TDD - 测试驱动开发

## 核心准则

**先写测试，看着它失败，写最少的代码让它通过。**

如果没看过测试失败，就不知道测试的是不是对的东西。

**违反规则的字面意思 = 违反规则的精神。**

## 铁律

```
没有失败的测试，就不能写生产代码

先写代码后写测试？删除它。从头开始。
```

**没有例外：**
- 不要留着当"参考"
- 不要在写测试时"改编"它
- 不要看它
- 删除就是删除

## 红-绿-重构循环

### RED - 写失败的测试

写一个最小测试，展示应该发生什么。

```python
# Good: 清晰名称，测试真实行为，一件事
def test_retries_failed_operations_3_times():
    attempts = 0
    def operation():
        nonlocal attempts
        attempts += 1
        if attempts < 3:
            raise Exception('fail')
        return 'success'
    
    result = retry_operation(operation)
    assert result == 'success'
    assert attempts == 3
```

```
❌ Bad: 模糊名称，测试 mock 而不是真实代码
def test_retry_works():
    mock = MagicMock()
    mock.side_effect = [Exception(), Exception(), 'success']
```

**要求：**
- 一个行为
- 清晰名称
- 真实代码（除非不可避免才用 mock）

### 验证 RED - 看着它失败

**必须执行，从不跳过。**

```bash
pytest tests/path/test_name.py -v
# 或项目对应的测试命令
```

确认：
- 测试失败（不是错误）
- 失败信息符合预期
- 失败是因为功能缺失（不是拼写错误）

**测试通过了？** 你在测试已有行为。修复测试。

### GREEN - 最少代码

写最简单的代码让测试通过。

```python
# Good: 刚好够通过测试
def retry_operation(fn, max_retries=3):
    for i in range(max_retries):
        try:
            return fn()
        except Exception as e:
            if i == max_retries - 1:
                raise e
```

```
❌ Bad: 过度设计，加入了测试没要求的功能
def retry_operation(fn, options={max_retries=3, backoff='linear', onRetry=...})
    # YAGNI - 你不需要这个
```

不要添加功能、不要重构其他代码、不要"改进"超过测试要求。

### 验证 GREEN - 看着它通过

**必须执行。**

```bash
pytest tests/path/test_name.py -v
```

确认：
- 测试通过
- 其他测试仍然通过
- 输出干净（无错误、无警告）

**测试失败？** 修代码，不是测试。

**其他测试失败？** 立刻修。

### REFACTOR - 重构

只有在 green 之后：
- 移除重复
- 改进名称
- 提取辅助函数

保持测试 green。不添加行为。

### 重复

下一个失败测试对应下一个功能。

## 好测试的特征

| 质量 | 好 | 坏 |
|------|----|----|
| **最小** | 一件事。名称里有"and"？拆开它。 | `test_validates_email_and_domain_and_whitespace` |
| **清晰** | 名称描述行为 | `test_test1` |
| **展示意图** | 展示期望的 API | 掩盖代码应该做什么 |

## 常见借口（别信）

| 借口 | 现实 |
|------|------|
| "太简单了不用测" | 简单代码也会坏。测试只用30秒。 |
| "我之后测" | 测试立即通过什么也证明不了。 |
| "之后测试也能达到同样目标" | 之后测试 = "这代码干了什么？" 先写测试 = "这代码应该干什么？" |
| "我已经手动测试了" | 手动测试是随意的。没有记录，不能重跑。 |
| "删除 X 小时的工作太浪费" | 沉没成本谬误。留着不可信的代码是技术债。 |
| "留着当参考" | 你会改编它。那就是之后测试。删除就是删除。 |
| "TDD 太教条" | TDD 是务实的：比事后调试快。 |

## 红旗 - 停止并从头开始

- 先写代码后写测试
- 实现之后才写测试
- 测试立即通过
- 说不清为什么测试失败
- "之后"添加的测试
- 合理化"就这一次"
- "我已经手动测试了"
- "之后测试能达到同样目的"
- "这是精神不是仪式"
- "留着当参考"或"改编现有代码"

**所有这些意味着：删除代码。从头用 TDD。**

## Bug 修复示例

**Bug:** 空邮箱被接受

**RED**
```python
def test_rejects_empty_email():
    result = submit_form({'email': ''})
    assert result['error'] == 'Email required'
```

**Verify RED**
```
$ pytest
FAIL: expected 'Email required', got undefined
```

**GREEN**
```python
def submit_form(data):
    if not data.get('email', '').strip():
        return {'error': 'Email required'}
    # ...
```

**Verify GREEN**
```
$ pytest
PASS
```

**REFACTOR**
如需要可提取多字段验证。

## 完成前检查表

- [ ] 每个新函数/方法都有测试
- [ ] 实现前看过每个测试失败
- [ ] 每个测试失败原因符合预期（功能缺失，不是 typo）
- [ ] 写了最少的代码让每个测试通过
- [ ] 所有测试通过
- [ ] 输出干净（无错误、无警告）
- [ ] 测试使用真实代码（除非不可避免才用 mock）
- [ ] 覆盖了边界情况和错误

不能全部打勾？跳过了 TDD。从头开始。

Related Skills

superpowers-executing-plans

3891

from openclaw/skills

Use when executing a written implementation plan in the current session with sequential task execution and review checkpoints - for when subagent-driven mode is not available

superpowers-writing-plans

3891

from openclaw/skills

Use when you have a spec or requirements for a multi-step task, before touching code - guides writing comprehensive implementation plans with bite-sized tasks, TDD, and DRY/YAGNI principles

superpowers-verification

3891

from openclaw/skills

Use when about to claim any work is complete, fixed, passing, or successful - requires running fresh verification commands and reading actual output before making any success claims; evidence before assertions always

superpowers-systematic-debugging

3891

from openclaw/skills

Use when encountering any bug, test failure, or unexpected behavior - enforces systematic four-phase debugging: root cause investigation, pattern analysis, hypothesis testing, and evidence-based fix verification

superpowers-subagent-dev

3891

from openclaw/skills

Use when executing implementation plans with independent tasks - coordinates task execution by dispatching subagents per task with verification checkpoints, adapted for OpenClaw's isolated session model

superpowers-parallel-agents

3891

from openclaw/skills

Use when facing 2 or more independent tasks that can be worked on without shared state - dispatches parallel subagents using sessions_spawn for concurrent investigation and execution, adapted for OpenClaw

superpowers-overview

3891

from openclaw/skills

Use when starting any development work or when unsure which superpowers development skill to use - provides entry point and navigation to the full superpowers skill suite for OpenClaw agents

superpowers-isolated-workspace

3891

from openclaw/skills

Use when starting feature work that needs isolation from current workspace - creates isolated git branches with clean setup and safety verification, adapted for OpenClaw environments

superpowers-finishing-branch

3891

from openclaw/skills

Use when implementation is complete, all tests pass, and you need to decide how to integrate the work - presents structured options for merge, PR, or cleanup; adapted for OpenClaw git workflow without worktrees

superpowers-brainstorming

3891

from openclaw/skills

Use before any creative work - creating features, building components, adding functionality, or modifying behavior - guides through exploration, questioning, design proposal, and spec documentation before any implementation

superpowers-requesting-code-review

3891

from openclaw/skills

Use when completing tasks, implementing major features, or before merging - dispatches code review subagent to catch issues before they cascade, adapted for OpenClaw sessions_spawn model

superpowers-receiving-code-review

3891

from openclaw/skills

Use when receiving code review feedback - requires technical verification before implementing suggestions, with reasoned pushback when feedback is technically questionable; no performative agreement