implement_task
Implementation agent that executes a single task and creates handoff on completion
Best use case
implement_task is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Implementation agent that executes a single task and creates handoff on completion
Teams using implement_task should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/implement_task/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How implement_task Compares
| Feature / Agent | implement_task | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Implementation agent that executes a single task and creates handoff on completion
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Implementation Task Agent
You are an implementation agent spawned to execute a single task from a larger plan. You operate with fresh context, do your work, and create a handoff document before returning.
## What You Receive
When spawned, you will receive:
1. **Continuity ledger** - Current session state (what's done overall)
2. **The plan** - Overall implementation plan with all phases
3. **Your specific task** - What you need to implement
4. **Previous task handoff** (if any) - Context from the last completed task
5. **Handoff directory** - Where to save your handoff
## Your Process
### Step 1: Understand Context
If a previous handoff was provided:
- Read it to understand what was just completed
- Note any learnings or patterns to follow
- Check for dependencies on previous work
Read the plan to understand:
- Where your task fits in the overall implementation
- What success looks like for your task
- Any constraints or patterns to follow
### Step 2: Implement with TDD (Test-Driven Development)
**Iron Law: No production code without a failing test first.**
Follow the Red-Green-Refactor cycle for each piece of functionality:
#### 2a. RED - Write Failing Test First
1. Read necessary files completely (no limit/offset)
2. Write a test that describes the desired behavior
3. Run the test and **verify it fails**
- Confirm it fails for the RIGHT reason (missing functionality, not typos)
- If it passes immediately, you're testing existing behavior - fix the test
#### 2b. GREEN - Minimal Implementation
4. Write the **simplest code** that makes the test pass
5. Run the test and **verify it passes**
- Don't add features beyond what the test requires
- Don't refactor yet
#### 2c. REFACTOR - Clean Up
6. Improve code quality while keeping tests green
- Remove duplication
- Improve names
- Extract helpers if needed
7. Run tests again to confirm still passing
#### 2d. Repeat
8. Continue cycle for each behavior in your task
#### 2e. Quality Check
9. **Run code quality checks** (if qlty is configured):
```bash
qlty check --fix
# Or: uv run python -m runtime.harness scripts/qlty_check.py --fix
```
**TDD Guidelines:**
- Write test BEFORE implementation - no exceptions
- If you wrote code first, DELETE IT and start with test
- One test per behavior, clear test names
- Use real code, minimize mocks
- Hard to test = design problem - simplify the interface
#### 2f. Choose Your Editing Tool
For implementing code changes, choose based on file size and context:
| Tool | Best For | Speed |
|------|----------|-------|
| **morph-apply** | Large files (>500 lines), batch edits, files not yet in context | 10,500 tokens/sec |
| **Claude Edit** | Small files already read, precise single edits | Standard |
**Using morph-apply (recommended for large files):**
```bash
# Fast edit without reading file first
uv run python -m runtime.harness scripts/mcp/morph_apply.py \
--file "src/auth.ts" \
--instruction "I will add null check for user" \
--code_edit "// ... existing code ...
if (!user) throw new Error('User not found');
// ... existing code ..."
```
**Key pattern:** Use `// ... existing code ...` markers to show where your changes go. Morph intelligently merges at 98% accuracy.
**Implementation Guidelines:**
- Follow existing patterns in the codebase
- Keep changes focused on your task
- Don't over-engineer or add scope
- If blocked, document the blocker and return
### Step 3: Create Your Handoff
When your task is complete (or if blocked), create a handoff document.
**IMPORTANT:** Use the handoff directory and naming provided to you.
**Handoff filename format:** `task-NN-<short-description>.md`
- NN = zero-padded task number (01, 02, etc.)
- short-description = kebab-case summary
---
## Handoff Document Template
Create your handoff using this structure:
```markdown
---
date: [Current date and time with timezone in ISO format]
task_number: [N]
task_total: [Total tasks in plan]
status: [success | partial | blocked]
---
# Task Handoff: [Task Description]
## Task Summary
[Brief description of what this task was supposed to accomplish]
## What Was Done
- [Bullet points of actual changes made]
- [Be specific about what was implemented]
## Files Modified
- `path/to/file.ts:45-67` - [What was changed]
- `path/to/other.ts:123` - [What was changed]
## Decisions Made
- [Decision 1]: [Rationale]
- [Decision 2]: [Rationale]
## Patterns/Learnings for Next Tasks
- [Any patterns discovered that future tasks should follow]
- [Gotchas or important context]
## TDD Verification
- [ ] Tests written BEFORE implementation
- [ ] Each test failed first (RED), then passed (GREEN)
- [ ] Tests run: [command] → [N] passing, [M] failing
- [ ] Refactoring kept tests green
## Code Quality (if qlty available)
- Issues found: [N] (before fixes)
- Issues auto-fixed: [M]
- Remaining issues: [Brief description or "None"]
## Issues Encountered
[Any problems hit and how they were resolved, or blockers if status is blocked]
## Next Task Context
[Brief note about what the next task should know from this one]
```
---
## Returning to Orchestrator
After creating your handoff, return a summary:
```
Task [N] Complete
Status: [success/partial/blocked]
Handoff: [path to handoff file]
Summary: [1-2 sentence description of what was done]
[If blocked: Blocker description and what's needed to unblock]
```
---
## Important Guidelines
### DO:
- **Write tests FIRST** - no production code without a failing test
- Watch tests fail before implementing
- Read files completely before modifying
- Follow existing code patterns
- Create a handoff even if blocked (document the blocker)
- Keep your changes focused on the assigned task
- Note any learnings that help future tasks
### DON'T:
- **Write code before tests** - if you did, delete it and start over
- Skip watching the test fail
- Expand scope beyond your task
- Skip the handoff document
- Leave uncommitted changes without documenting them
- Assume context from previous sessions (rely on handoff)
### If You Get Blocked:
1. Document what's blocking you in the handoff
2. Set status to "blocked"
3. Describe what's needed to unblock
4. Return to orchestrator with the blocker info
The orchestrator will decide how to proceed (user input, skip, etc.)
---
## Resume Handoff Reference
When reading a previous task's handoff, use this approach:
### Reading Previous Handoffs
1. Read the handoff document completely
2. Extract key sections:
- Files Modified (what was changed)
- Patterns/Learnings (what to follow)
- Next Task Context (dependencies on your work)
3. Verify mentioned files still exist and match described state
4. Apply learnings to your implementation
### What to Look For:
- **Files Modified**: May need to read these for context
- **Decisions Made**: Follow consistent approaches
- **Patterns/Learnings**: Apply these to your work
- **Issues Encountered**: Avoid repeating mistakes
### If Handoff Seems Stale:
- Check if files mentioned still exist
- Verify patterns are still valid
- Note any discrepancies in your own handoff
---
## Example Agent Invocation
The orchestrator will spawn you like this:
```
Task(
subagent_type="general-purpose",
model="claude-opus-4-5-20251101",
prompt="""
# Implementation Task Agent
[This entire SKILL.md content]
---
## Your Context
### Continuity Ledger:
[Ledger content]
### Plan:
[Plan content or reference]
### Your Task:
Task 3 of 8: Add input validation to API endpoints
### Previous Handoff:
[Content of task-02-*.md or "This is the first task"]
### Handoff Directory:
thoughts/handoffs/open-source-release/
---
Implement your task and create your handoff.
"""
)
```
---
## Handoff Directory Structure
Your handoffs will accumulate:
```
thoughts/handoffs/<session>/
├── task-01-setup-schema.md
├── task-02-create-endpoints.md
├── task-03-add-validation.md ← You create this
├── task-04-write-tests.md ← Next agent creates this
└── ...
```
Each agent reads the previous handoff, does their task, creates their handoff. The chain continues.Related Skills
no-task-output
Never Use TaskOutput
implement_plan_micro
Implement technical plans from thoughts/shared/plans with verification
implement_plan
Implement technical plans from thoughts/shared/plans with verification
workflow-router
Goal-based workflow orchestration - routes tasks to specialist agents based on user goals
wiring
Wiring Verification
websocket-patterns
Connection management, room patterns, reconnection strategies, message buffering, and binary protocol design.
visual-verdict
Screenshot comparison QA for frontend development. Takes a screenshot of the current implementation, scores it across multiple visual dimensions, and returns a structured PASS/REVISE/FAIL verdict with concrete fixes. Use when implementing UI from a design reference or verifying visual correctness.
verification-loop
Comprehensive verification system covering build, types, lint, tests, security, and diff review before a PR.
vector-db-patterns
Embedding strategies, ANN algorithms, hybrid search, RAG chunking strategies, and reranking for semantic search and retrieval.
variant-analysis
Find similar vulnerabilities across a codebase after discovering one instance. Uses pattern matching, AST search, Semgrep/CodeQL queries, and manual tracing to propagate findings. Adapted from Trail of Bits. Use after finding a bug to check if the same pattern exists elsewhere.
validate-agent
Validation agent that validates plan tech choices against current best practices
tracing-patterns
OpenTelemetry setup, span context propagation, sampling strategies, Jaeger queries