auto-test-execution
Automatically execute tests when code-generating agents modify source files, enforcing the execute-before-return pattern
Best use case
auto-test-execution is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
It is a strong fit for teams already working in Codex.
Automatically execute tests when code-generating agents modify source files, enforcing the execute-before-return pattern
Teams using auto-test-execution should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/auto-test-execution/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How auto-test-execution Compares
| Feature / Agent | auto-test-execution | Standard Approach |
|---|---|---|
| Platform Support | Codex | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Automatically execute tests when code-generating agents modify source files, enforcing the execute-before-return pattern
Which AI agents support this skill?
This skill is designed for Codex.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
Cursor vs Codex for AI Workflows
Compare Cursor and Codex for AI coding workflows, repository assistance, debugging, refactoring, and reusable developer skills.
AI Agents for Marketing
Discover AI agents for marketing workflows, from SEO and content production to campaign research, outreach, and analytics.
SKILL.md Source
# auto-test-execution
Automatically execute tests when code-generating agents write to source files, enforcing the execute-before-return pattern.
## Triggers
Primary phrases matched automatically from skill description. No additional alternate expressions defined.
## Purpose
This skill enforces the MetaGPT executable feedback pattern: code-generating agents must execute tests before returning results to the user. It activates automatically when agents modify source code files.
## Behavior
When triggered, this skill:
1. **Detect modified files**:
- Track which source files the agent has written to
- Identify the relevant test framework
2. **Find related tests**:
- Look for test files matching the modified source
- Convention: `src/foo/bar.ts` -> `test/unit/foo/bar.test.ts`
- If no tests exist, prompt agent to generate them
3. **Execute tests**:
- Run the project's test command focused on relevant tests
- Capture results: passed, failed, errors
4. **Handle results**:
- All pass: Allow agent to return results
- Failures: Trigger debug-and-retry loop (max 3 attempts)
- Persistent failures: Escalate with debug memory context
5. **Update debug memory**:
- Record session in `.aiwg/ralph/debug-memory/sessions/`
- Extract patterns for future reference
## Activation Conditions
```yaml
activation:
always_active_for:
- software-implementer
- debugger
- test-engineer
triggered_by:
- file_write:
patterns:
- "src/**/*.ts"
- "src/**/*.js"
- "src/**/*.py"
- "**/*.go"
- "**/*.rs"
skip_when:
- test_files_only: true
- documentation_only: true
- configuration_only: true
```
## Integration
This skill uses:
- `project-awareness`: Detect test framework and configuration
- Debug memory at `.aiwg/ralph/debug-memory/` for pattern learning
## References
- @$AIWG_ROOT/agentic/code/frameworks/sdlc-complete/rules/executable-feedback.md - Feedback rules
- @$AIWG_ROOT/agentic/code/addons/ralph/docs/executable-feedback-guide.md - Guide
- @$AIWG_ROOT/agentic/code/addons/ralph/schemas/debug-memory.yaml - Memory schema
- @.aiwg/research/findings/REF-013-metagpt.md - Research foundationRelated Skills
pytest-runner
Execute Python tests with pytest, supporting fixtures, markers, coverage, and parallel execution. Use for Python test automation.
vitest-runner
Execute JavaScript/TypeScript tests with Vitest, supporting coverage, watch mode, and parallel execution. Use for JS/TS test automation.
test-sync
Detect orphaned tests, obsolete assertions, and test-code misalignment. Use for test suite maintenance, cleanup, and traceability validation.
test-coverage
Analyze test coverage reports, identify gaps by priority, and recommend test improvements with trend tracking
mutation-test
Run mutation testing to validate test quality beyond code coverage. Use when assessing test effectiveness, finding weak tests, or validating test suite quality.
issue-auto-sync
Detect issue references in commits and artifacts and automatically update or close linked tracker issues
generate-tests
Generate comprehensive test suite for $ARGUMENTS following project testing conventions and best practices.
flow-test-strategy-execution
Orchestrate comprehensive test strategy with test suite execution, coverage validation, defect triage, and regression analysis
execution-mode
Set or report the reproducibility mode governing how AIWG workflows execute, from standard to fully auditable
devkit-test
Auto-fix discoverable issues
auto-provenance
Generate W3C PROV-compliant provenance records automatically when agents create or modify artifacts
aiwg-orchestrate
Route structured artifact work to AIWG workflows via MCP with zero parent context cost