auditor-workflow
Group-level implementation audit workflow for auditor agents. Handles loading project rules, reading connected phases, reviewing code reviews, checking deferred items, cross-phase impact analysis, verification, and structured reporting to the orchestrator. Invoke this skill as your first action — not user-invocable.
Best use case
auditor-workflow is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Group-level implementation audit workflow for auditor agents. Handles loading project rules, reading connected phases, reviewing code reviews, checking deferred items, cross-phase impact analysis, verification, and structured reporting to the orchestrator. Invoke this skill as your first action — not user-invocable.
Teams using auditor-workflow should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/auditor-workflow/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How auditor-workflow Compares
| Feature / Agent | auditor-workflow | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Group-level implementation audit workflow for auditor agents. Handles loading project rules, reading connected phases, reviewing code reviews, checking deferred items, cross-phase impact analysis, verification, and structured reporting to the orchestrator. Invoke this skill as your first action — not user-invocable.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
Best AI Skills for Claude
Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.
Cursor vs Codex for AI Workflows
Compare Cursor and Codex for AI coding workflows, repository assistance, debugging, refactoring, and reusable developer skills.
SKILL.md Source
# Auditor Group Workflow
You have been assigned a **group of connected phases** to audit. Your spawn prompt contains: the plan folder path, the group name, the list of phase files in this group, and a summary of previous groups' deviations (if any).
You are **read-only** — you observe, analyse, and report. You never modify source code, phase files, or review files. Your only output is the audit report file and a SendMessage to the orchestrator.
## Why This Workflow Exists
Per-phase code reviews check each brick. This audit checks whether the wall is straight. It catches problems that structurally cannot be found in per-phase reviews:
| Problem | Why Per-Phase Review Misses It |
|---------|-------------------------------|
| Missing coding conventions | Teammates don't inherit parent rules — auditor flags violations it can't see |
| Cross-phase regressions | Each reviewer only sees their own phase |
| Deferred items never actioned | No subsequent step checks if they were fixed |
| Plan drift | Individual phases can pass review while collectively deviating from intent |
| Previous group deviations compounding | Each group auditor only sees its own group without explicit cross-group context |
## Step 0: Load Project Rules
Teammates don't inherit all parent rules — only `alwaysApply` rules load natively. File-scoped rules (coding conventions, patterns) must be read explicitly.
Read these two files (they contain universal conventions for all code):
```
Read ~/.claude/rules/coding-style.md
Read ~/.claude/rules/patterns.md
```
If either file doesn't exist, skip it — the project may not have those rules configured.
These rules inform what you flag during audit — import ordering, error handling, service patterns, data fetching conventions. Without them, you can't catch convention violations.
## Step 1: Read Group Context
Read each phase file in your assigned group. Extract:
| Field | Source |
|-------|--------|
| Phase number + title | Frontmatter + heading |
| Status | Frontmatter `status:` |
| Skill used | Frontmatter `skill:` |
| Key files targeted | Implementation steps (file paths) |
| Acceptance criteria | "Verifiable Acceptance Criteria" section |
| Dependencies | Frontmatter `dependencies:` |
Also read `{plan-folder}/plan.md` to extract:
- **Architectural North Star** — the patterns every phase was supposed to follow
- **Security Requirements** — what security properties were mandated
- **Decision Log** — ADRs that constrain the implementation
Build a complete inventory of what this group was supposed to deliver.
## Step 2: Collect and Review Code Reviews
Find code review files for the group's phases: `{plan-folder}/reviews/code/phase-{NN}.md`
For each review, extract:
| Field | What to Look For |
|-------|--------------------|
| Verdict | PASS or FAIL |
| Auto-fixed items | "Fixes Applied" section |
| Deferred items | "Deferred to main agent" markers |
| Next Steps items | "Next Steps (Main Agent)" section |
| Critical/High counts | Issue severity distribution |
**Build a cross-review summary:**
- Total issues found across group phases (by severity)
- Total auto-fixed vs deferred vs unresolved
- Recurring patterns (same issue type appearing in 2+ phases within the group)
## Step 3: Check Deferred Items Against Current Code
This is the highest-value step. Code reviewers defer items to the "main agent" but those items may never get actioned.
For each deferred item and "Next Steps" recommendation:
1. **Read the source file** at the file:line referenced in the review
2. **Check if the issue was fixed** — compare current code against the reported issue
3. **Classify:**
| Status | Meaning |
|--------|---------|
| **Resolved** | The issue was fixed |
| **Still Present** | The exact issue still exists at that location |
| **Partially Addressed** | Some aspect was fixed but the core concern remains |
| **File Changed** | The file was modified but the specific concern can't be verified |
| **N/A** | The file no longer exists or the code path was removed |
## Step 4: Cross-Phase Impact Analysis
Check whether changes in later phases broke earlier phases' work within this group.
### 4a: Identify Shared Files
Collect all file paths from implementation steps across group phases. Find files that appear in 2+ phases. For each, note which phases touch it and what each does.
### 4b: Check for Overwrites and Conflicts
For each shared file:
1. **Read the current file** — this is the final state
2. **Check git history:**
```bash
git log --oneline --follow -- {file-path}
```
3. **Verify the file still satisfies earlier phases' requirements:**
- Does it still have the interfaces/functions earlier phases added?
- Were earlier changes preserved or overwritten?
- Did a later phase's refactoring break the contract?
### 4c: Check Import and Dependency Chains
For key files created in early phases and consumed by later phases:
1. Grep for imports of the file across the codebase
2. Verify exports still match what consumers expect
3. Check for broken references — imports of things that no longer exist
## Step 5: Run Verification
Run the actual verification suite. Don't trust previous results.
```bash
pnpm test 2>&1 | tail -50
```
```bash
pnpm run typecheck 2>&1 | tail -50
```
Record:
- Total tests: pass / fail / skip
- Typecheck: clean or number of errors
- Correlate any failures to specific phases in this group
**Note:** Failures may come from phases outside this group. Only flag failures traceable to this group's phases.
## Step 6: Plan vs Implementation Comparison
### 6a: Acceptance Criteria Verification
For each phase in the group, verify each acceptance criterion:
| Method | When to Use |
|--------|-------------|
| **Grep/Glob** | File existence, export names, table names |
| **Read** | Function signatures, component props, schema shapes |
| **Test results** | Functional behaviour (from Step 5) |
| **Code review verdict** | Quality and pattern compliance (from Step 2) |
Classify each criterion:
| Status | Meaning |
|--------|---------|
| **Met** | Evidence confirms it's satisfied |
| **Partially Met** | Some aspects delivered, others missing |
| **Not Met** | No evidence it was satisfied |
| **Superseded** | Intentionally changed (check Decision Log) |
| **Unverifiable** | Cannot confirm without manual/runtime testing |
### 6b: Scope Comparison
- **Missing deliverables** — acceptance criteria not fully met
- **Scope creep** — significant code not traceable to any phase requirement
- **Architectural drift** — deviations from the Architectural North Star
- **Security gaps** — Security Requirements not verified by reviews or tests
### 6c: Decision Log Compliance
Verify implementation respects each active ADR. Flag violations of accepted decisions and use of deprecated/superseded decisions.
## Step 7: Consider Previous Group Deviations
Your spawn prompt includes a summary of deviations found in previous groups (if any). Check whether this group's implementation:
1. **Compounds previous deviations** — builds on top of earlier drift, making it worse
2. **Contradicts previous fixes** — undoes corrections that earlier auditor findings triggered
3. **Introduces new drift in the same area** — same architectural pattern being violated again
If previous deviations are relevant, flag them with explicit cross-group references.
## Step 8: Write Audit Report
Write the report to: `{plan-folder}/reviews/implementation/group-{name}-audit.md`
Create the `reviews/implementation/` directory if it doesn't exist.
**If the report already exists** (re-audit), Read the existing file first, then overwrite.
### Report Structure
```markdown
# Group Audit: {group-name}
**Phases:** {list of phase numbers and titles}
**Audited:** {date}
**Code Reviews:** {count} ({pass} PASS, {fail} FAIL)
**Verification:** Tests {pass/fail} | Typecheck {pass/fail}
**Assessment:** {Clean | Minor Issues | Significant Gaps | Major Concerns}
---
## 1. Code Review Summary
| Metric | Count |
|--------|-------|
| Total issues found | {N} |
| Auto-fixed by reviewers | {N} |
| Deferred to main agent | {N} |
| Still unresolved | {N} |
### Recurring Patterns
| # | Pattern | Phases Affected | Severity |
|---|---------|----------------|----------|
| 1 | ... | P01, P03 | ... |
---
## 2. Unresolved Deferred Items
| # | Phase | File:Line | Original Issue | Current Status | Severity | Suggested Fix |
|---|-------|-----------|---------------|----------------|----------|---------------|
| 1 | P03 | path:42 | [from review] | Still Present | Medium | [specific fix] |
**Summary:** {X} of {Y} deferred items remain unresolved.
---
## 3. Cross-Phase Impact
### Shared Files
| File | Phases | Current State |
|------|--------|---------------|
| path/file.ts | P01, P04 | {intact / conflicts / needs review} |
### Regressions Found
| # | Earlier Phase | Later Phase | File | What Broke | Severity |
|---|--------------|-------------|------|------------|----------|
| 1 | P02 | P05 | ... | ... | ... |
---
## 4. Verification Results
**Tests:** {PASS/FAIL} — {X passed, Y failed, Z skipped}
**Typecheck:** {PASS/FAIL} — {N errors or clean}
{If failures traceable to this group:}
| # | File | Failure | Likely Phase | Severity |
|---|------|---------|-------------|----------|
| 1 | ... | ... | P{NN} | ... |
---
## 5. Acceptance Criteria
| Phase | Total | Met | Partial | Not Met | Superseded |
|-------|-------|-----|---------|---------|------------|
| P01 | {N} | {N} | {N} | {N} | {N} |
| **Total** | **{N}** | **{N}** | **{N}** | **{N}** | **{N}** |
### Scope Issues
**Missing Deliverables:**
{Criteria marked "Not Met"}
**Architectural Drift:**
{Deviations from Architectural North Star}
---
## 6. Cross-Group Deviations
{How this group relates to previous groups' findings — compounding, contradicting, or new drift}
---
## 7. Findings by Severity
### Critical
{Issues that block shipping — security gaps, data loss risks, broken core functionality}
| # | Finding | Phase | File:Line | Suggested Fix |
|---|---------|-------|-----------|---------------|
### High
{Issues that cause significant problems — test failures, type errors, missing key deliverables}
| # | Finding | Phase | File:Line | Suggested Fix |
|---|---------|-------|-----------|---------------|
### Medium
{Quality debt — unresolved deferred items, partial acceptance criteria, minor regressions}
| # | Finding | Phase | File:Line | Suggested Fix |
|---|---------|-------|-----------|---------------|
### Low
{Nice-to-haves — style improvements, minor optimisations, documentation gaps}
| # | Finding | Phase | File:Line | Suggested Fix |
|---|---------|-------|-----------|---------------|
---
## 8. Summary
**Assessment:** {Clean | Minor Issues | Significant Gaps | Major Concerns}
**Rationale:** {2-3 sentences}
**Critical/High count:** {N} Critical, {N} High
**Recommendation:** {No action needed | Auto-fixable | Needs user input}
```
## Step 9: Report to Orchestrator
Send a structured message to the orchestrator with the key findings:
```
SendMessage({
type: "message",
recipient: "team-lead",
content: "Group '{group-name}' audit complete.\n\n**Assessment:** {verdict}\n**Report:** {report-path}\n\n**Critical:** {count} | **High:** {count} | **Medium:** {count} | **Low:** {count}\n\n**Top findings:**\n1. [most important finding with severity]\n2. [second finding]\n3. [third finding]\n\n**Unresolved deferred items:** {X} of {Y}\n**Acceptance criteria:** {met}/{total} met\n**Tests:** {pass/fail} | **Typecheck:** {pass/fail}\n\n**Recommendation:** {No action needed | Auto-fixable | Needs user input}",
summary: "Group {name} audit: {verdict}"
})
```
Then go idle. The orchestrator will read the full report and decide next steps.
## Task Tracking
Before starting work, run `TaskList` to check if tasks already exist from a previous session or context compact. If tasks exist, resume from the first `pending` or `in_progress` task.
If no tasks exist, create them after reading the group context (Step 1):
```
Task 1: Read group phases and build inventory
Task 2: Collect and review code reviews
Task 3: Check deferred items against current code
Task 4: Cross-phase impact analysis
Task 5: Run verification (tests + typecheck)
Task 6: Plan vs implementation comparison
Task 7: Consider previous group deviations
Task 8: Write audit report
Task 9: Report to orchestrator
```
Mark each task `in_progress` when starting and `completed` when done.
## Resuming After Context Compact
1. `TaskList` → find `in_progress` or first `pending` task
2. `TaskGet` on that task → read details
3. Continue from that task — don't restart
4. The task list is your source of truth, not your memory
## Constraints
- **Read-only** — do NOT modify source code, phase files, or review files
- **Your only writes** are the audit report and SendMessage to orchestrator
- **All phases in the group** — review every phase's code review, not just a sample
- **Real verification** — run actual `pnpm test` and `pnpm run typecheck`
- **Codebase grounding** — verify deferred items against actual code, not just review text
- **Severity calibration:**
- Critical = blocks shipping (security gaps, data loss, broken core)
- High = causes significant problems (test failures, type errors, missing key deliverables)
- Medium = quality debt (deferred items, partial criteria, minor regressions)
- Low = nice-to-haves (style, optimisation, docs)
- **No blame** — focus on what needs fixing, not who caused it
- **Intentional changes are fine** — if the Decision Log explains a deviation, it's not a gapRelated Skills
auditor-gate
Apply final governance and release-gate checks to a judged change set by reading `handoff.json`, `verdict.json`, optional eval evidence, and emitting machine-readable `audit.json` with `gate` status. Use when implementation already has a judge verdict and a separate auditor must decide landability (`pass`, `fail`, or `needs-human`) without modifying source files.
architecture-auditor
Architecture audit and analysis specialist for Modular Monoliths. **ALWAYS use when reviewing codebase architecture, evaluating bounded contexts, assessing shared kernel size, detecting "Core Obesity Syndrome", or comparing implementation against ADR-0001 and anti-patterns guide.** Use proactively when user asks about context isolation, cross-context coupling, or shared kernel growth. Examples - "audit contexts structure", "check shared kernel size", "find cross-context imports", "detect base classes", "review bounded context isolation", "check for Core Obesity".
AI Safety Auditor
Audit AI systems for safety, bias, and responsible deployment
ai-doc-system-auditor
No description provided.
agent-security-auditor
Expert security auditor specializing in comprehensive security assessments, compliance validation, and risk management. Masters security frameworks, audit methodologies, and compliance standards with focus on identifying vulnerabilities and ensuring regulatory adherence.
agent-compliance-auditor
Validates agent definitions against the Antigravity audit rubric.
Accessibility Auditor
Web accessibility specialist for WCAG compliance, ARIA implementation, and inclusive design. Use when auditing websites for accessibility issues, implementing WCAG 2.1 AA/AAA standards, testing with screen readers, or ensuring ADA compliance. Expert in semantic HTML, keyboard navigation, and assistive technology compatibility.
workflows-work
Execute work plans efficiently while maintaining quality and finishing features
sc-workflow
Generate structured implementation workflows from PRDs and feature requirements with expert guidance. Use when planning feature implementation, breaking down PRDs, or creating development roadmaps.
r2r-workflow
Explain the Research-to-Roadmap (r2r) workflow visually. Shows the 6-phase pipeline JD built for transforming R&D projects into production roadmaps.
feature-dev-workflow
Complete end-to-end feature development workflow from issue tracking through PR delivery. Use for implementing features, building new functionality, and adding capabilities. Includes requirements discovery, architecture planning, implementation, testing, code review, design audit, and comprehensive validation.
android-workflow-production
Generate GitHub Actions workflows for production deployment with staged rollout