tdd-enforcement
Red-Green-Refactor TDD methodology with mandatory failing tests, minimal implementation, quality refactoring, and 80% coverage gating.
Best use case
tdd-enforcement is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Red-Green-Refactor TDD methodology with mandatory failing tests, minimal implementation, quality refactoring, and 80% coverage gating.
Teams using tdd-enforcement should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/tdd-enforcement/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How tdd-enforcement Compares
| Feature / Agent | tdd-enforcement | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Red-Green-Refactor TDD methodology with mandatory failing tests, minimal implementation, quality refactoring, and 80% coverage gating.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
SKILL.md Source
# TDD Enforcement ## Overview Strict test-driven development enforcement adapted from the Everything Claude Code methodology. Mandates the Red-Green-Refactor cycle with evidence-based verification at each phase. ## TDD Process ### 1. RED Phase - Write Failing Tests - Write tests that define expected behavior - Tests MUST fail (exit code 1) - Use CI=true or --run flag, never watch mode - Apply timeout guards (60s) to prevent hanging - Record exit code as evidence ### 2. GREEN Phase - Minimal Implementation - Write the minimal code to make tests pass - Do NOT add features not covered by tests - Do NOT optimize prematurely - Tests MUST pass (exit code 0) - Record exit code as evidence ### 3. REFACTOR Phase - Quality Improvement - Apply SOLID principles and clean code patterns - Improve naming, reduce coupling - Remove duplication - Run tests after EACH refactoring step - Tests MUST remain passing (exit code 0) ### 4. Coverage Gate - Measure coverage: statements, branches, functions, lines - Minimum 80% overall coverage required - Iterate: write additional tests for gaps until threshold met - Maximum 3 convergence iterations ## Rules - Never skip the RED phase - Never accept GREEN without exit code 0 - Never use watch mode in CI - Always record evidence (exit codes, coverage numbers) - Enforce 80% coverage threshold ## When to Use - All code implementation tasks - Feature development - Bug fixes (write regression test first) ## Agents Used - `tdd-guide` (primary consumer) - `code-reviewer` (validates test quality)
Related Skills
test-enforcement
Automated test validation, coverage checking, and quality metrics with aggressive defaults
process-builder
Scaffold new babysitter process definitions following SDK patterns, proper structure, and best practices. Guides the 3-phase workflow from research to implementation.
babysitter
Orchestrate via @babysitter. Use this skill when asked to babysit a run, orchestrate a process or whenever it is called explicitly. (babysit, babysitter, orchestrate, orchestrate a run, workflow, etc.)
yolo
Run Babysitter autonomously with minimal manual interruption.
user-install
Install the user-level Babysitter Codex setup.
team-install
Install the team-pinned Babysitter Codex workspace setup.
retrospect
Summarize or retrospect on a completed Babysitter run.
resume
Resume an existing Babysitter run from Codex.
project-install
Install the Babysitter Codex workspace integration into the current project.
plan
Plan a Babysitter workflow without executing the run.
observe
Observe, inspect, or monitor a Babysitter run.
model
Inspect or change Babysitter model-routing policy by phase.