workflow

Guide through structured delivery workflow with plan, implement, validate phases

8 stars

bystevengonsalvez

View on GitHub Installation ↓

Best use case

workflow is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Guide through structured delivery workflow with plan, implement, validate phases

Teams using workflow should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/workflow/SKILL.md --create-dirs "https://raw.githubusercontent.com/stevengonsalvez/agents-in-a-box/main/toolkit/packages/skills/workflow/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/workflow/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How workflow Compares

Feature / Agent	workflow	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Guide through structured delivery workflow with plan, implement, validate phases

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Workflow

You are tasked with orchestrating the complete research->plan->implement workflow for a given task or requirement. This command manages the entire pipeline from initial research through implementation.

## Initial Setup

When this command is invoked, respond with:
```
I'll orchestrate the complete workflow for your task.

This will include:
1. **Research**: Comprehensive investigation of the problem space
2. **Planning**: Creating a detailed implementation plan
3. **Implementation**: Executing the plan with verification

Please provide:
- The task/feature/bug you want to address
- Any specific constraints or requirements
- Whether you want to run the full pipeline or start from a specific phase

You can also say "workflow continue" if you have existing research or plans.
```

Then wait for the user's input.

### Phase 0: Project State Check

Before starting any workflow phase, check for persistent project state:

1. **Check for `.planning/` directory**:
- If exists: Read `.planning/STATE.md` to understand current position
- Load `.planning/ROADMAP.md` for phase context
- Resume from where STATE.md indicates

2. **If `.planning/` does NOT exist**:
- Ask: "Would you like to set up persistent project state? This tracks requirements, roadmap, and decisions across sessions."
- If yes: Initialize from templates
```bash
mkdir -p .planning/phases .planning/todos
```
- Copy templates from `{{HOME_TOOL_DIR}}/templates/planning/` to `.planning/`
- Guide user through filling in PROJECT.md (vision + constraints)
- If no: Continue with standard workflow (backward compatible)

3. **If `.planning/` exists and has ROADMAP.md**:
- Identify current phase from STATE.md
- Check if current phase has CONTEXT.md (decisions captured)
- If no CONTEXT.md: Suggest running `/discuss` before planning
- Load phase context for downstream commands

## Workflow Phases

### Phase 1: Research

If no existing research found in `research/` directory:

1. **Invoke research phase**:
- Call the `/research` command internally
- Conduct comprehensive multi-modal research
- Save findings to `research/YYYY-MM-DD_HH-MM-SS_topic.md`

2. **Review research output**:
- Read the generated research document
- Identify key findings and constraints
- Note any open questions that need clarification

3. **Present research summary**:
```
## Research Complete

Key findings:
- [Major discovery 1]
- [Major discovery 2]
- [Important constraint]

The research has been saved to: research/[filename].md

Shall I proceed to create an implementation plan based on these findings?
```

### Phase 2: Planning

After research is complete or if existing research is found:

1. **Invoke planning phase**:
- Call the `/plan` command with research context
- Reference the research document findings
- Create detailed, phased implementation plan

2. **Interactive planning**:
- Work with user to refine the plan
- Ensure all phases have clear success criteria
- Save to `plans/descriptive_name.md`

3. **Present plan summary**:
```
## Plan Complete

Implementation will be done in [N] phases:
1. [Phase 1 name] - [objective]
2. [Phase 2 name] - [objective]
...

The plan has been saved to: plans/[filename].md

Ready to begin implementation?
```

### Phase 3: Implementation

After plan is approved:

1. **Invoke implementation**:
- Call the `/implement` command with plan path
- Execute phase by phase
- Track progress with TodoWrite

2. **Progress tracking**:
- Update checkmarks in plan as phases complete
- Run verification after each phase
- Handle any issues that arise

3. **Present completion status**:
```
## Implementation Status

Phase 1: [Name] - Complete
Phase 2: [Name] - Complete
Phase 3: [Name] - Partial (see notes)

Verification Results:
- Tests: [status]
- Linting: [status]
- Build: [status]

Would you like me to run validation on the implementation?
```

### Phase 4: Validation (Optional)

If requested or if issues found:

1. **Run validation**:
- Execute all automated checks
- Verify against plan success criteria
- Generate validation report

2. **Present validation results**:
```
## Validation Report

Automated Checks:
All tests passing
Linting clean
Build successful

Plan Criteria:
[Criterion 1] - Verified
[Criterion 2] - Verified
[Criterion 3] - Needs manual verification

The implementation is ready for review.
```

### Phase 5: Knowledge Capture (Suggested)

After successful implementation, offer to capture learnings:

```
## Capture Learnings?

This task is complete. If you encountered any problems worth documenting
for future reference, consider running:

/compound

This will:
- Capture the solution as a searchable learning
- Save it to docs/solutions/ for future sessions
- Make it discoverable via /research

Useful when:
- You debugged a tricky issue
- You found a non-obvious solution
- You want to remember "the fix" for next time

Skip this if the work was straightforward with no notable discoveries.
```

### .planning/ State Updates

After each major phase transition, update `.planning/STATE.md`:
- After Research: Update "Last activity", add key findings to "Recent Decisions"
- After Plan: Update phase status, link to plan file
- After Implement: Update progress, mark completed requirements
- After Validate: Record validation status, update blockers if any

## Workflow Commands

The user can control the workflow with these commands:

- **`workflow start`** - Begin from research phase
- **`workflow continue`** - Resume from existing research/plan
- **`workflow skip-research`** - Start directly with planning
- **`workflow validate`** - Run validation on completed work
- **`workflow status`** - Show current progress

## Progress Tracking

Throughout the workflow, maintain a master todo list:

```
Master Workflow: [Task Name]
[ ] Research Phase
[ ] Codebase analysis
[ ] Documentation review
[ ] External research (if needed)
[ ] Synthesize findings
[ ] Planning Phase
[ ] Review research
[ ] Draft plan structure
[ ] Detail each phase
[ ] Define success criteria
[ ] Implementation Phase
[ ] Phase 1: [Name]
[ ] Phase 2: [Name]
[ ] Phase 3: [Name]
[ ] Validation Phase
[ ] Run automated tests
[ ] Verify success criteria
[ ] Generate report
```

Update this list as you progress through each phase.

## Handling Interruptions

If the workflow is interrupted:

1. **Save state**:
- Document what phase you're in
- Note what's been completed
- Save any partial work

2. **On resume**:
- Check for existing research in `research/`
- Check for existing plans in `plans/`
- Look for checkmarks in plans indicating progress
- Continue from where you left off

## Best Practices

1. **Always start with research** unless explicitly told to skip
2. **Plans must reference research findings** to ensure alignment
3. **Implementation must follow the plan** (adapt if needed, but document deviations)
4. **Validation is not optional** for production-ready code
5. **Keep the user informed** at each phase transition
6. **Use TodoWrite** to track detailed progress

## Example Usage

```
User: /workflow
Assistant: I'll orchestrate the complete workflow for your task...

User: Add OAuth2 authentication to the application
Assistant: I'll start by researching OAuth2 implementations and your current auth setup...
[Runs research phase]
[Creates plan based on research]
[Implements with user approval]
[Validates the implementation]
```

## Integration with Other Commands

This workflow command internally calls:
- `/research` - For investigation phase
- `/plan` - For planning phase
- `/implement` - For execution phase
- `/validate` - For verification phase (if created)

Each command can also be run independently if needed.

Related Skills

webapp-testing

from stevengonsalvez/agents-in-a-box

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.

validate

from stevengonsalvez/agents-in-a-box

Verify implementation against specifications

ui-ux-pro-max

from stevengonsalvez/agents-in-a-box

UI/UX design intelligence. 67 styles, 96 palettes, 57 font pairings, 25 charts, 13 stacks (React, Next.js, Vue, Svelte, Astro, Nuxt, SwiftUI, React Native, Flutter, Tailwind, shadcn/ui, Jetpack Compose). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient.

tui-style-guide

from stevengonsalvez/agents-in-a-box

TUI style guide for consistent terminal interface design

token-usage

from stevengonsalvez/agents-in-a-box

Show Claude Code token usage across sessions — daily, weekly, per-project, and per-session breakdowns. Parses {{HOME_TOOL_DIR}}/projects/**/*.jsonl for consumption data. Use when the user asks about token usage, costs, how many tokens were used, session statistics, or wants a usage report.

tmux-status

from stevengonsalvez/agents-in-a-box

Show status of all tmux sessions including dev environments, spawned agents, and running processes

tmux-monitor

from stevengonsalvez/agents-in-a-box

Monitor and report status of all tmux sessions including dev environments, spawned agents, and running processes. Uses tmuxwatch for enhanced visibility.

tmux-message

from stevengonsalvez/agents-in-a-box

Reliable peer-to-peer message delivery to other Claude Code instances via tmux send-keys. Use as a fallback when claude-peers MCP send_message fails to surface in the receiver's inbox (delivered server-side but receiver never picks it up — observed behaviour). Also use when sending a directive to a known Claude Code TUI session by tmux session name or fuzzy hint, or when injecting a multi-line directive into a peer's prompt and submitting it. Trigger phrases — "claude-peers fallback", "tmux send-keys", "send to peer via tmux", "inject directive", "deliver to nanoclaw/hermes peer", "peer message". Tmux-only — won't reach peers running outside tmux.

test-driven-development

from stevengonsalvez/agents-in-a-box

Use when implementing any feature or bugfix, before writing implementation code. Enforces RED-GREEN-REFACTOR cycle with test-first approach.

test-ainb

from stevengonsalvez/agents-in-a-box

Run tests for the ainb (agents-in-a-box) Rust workspace via a 5-layer strategy — unit, insta snapshot, mock-plugin compositing, real-plugin spawn, vhs recording. Wraps cargo + insta + vhs into one CLI. Use when Stevie says "/test-ainb", "test ainb", "run ainb tests", "snapshot <component>", "regenerate vhs tapes", or any phrasing about validating ainb test layers. The skill autodetects which ainb-tui worktree the cwd sits in and dispatches to scripts/run.sh.

sync-learnings

from stevengonsalvez/agents-in-a-box

Sync user-level agent config changes back to toolkit repository (works for Claude, Codex, Copilot)

swarm-status

from stevengonsalvez/agents-in-a-box

Display comprehensive status dashboard for a swarm team