openspec-ext-hack-through-test

Manual invocation only. OpenSpec-specific hack-through-testing workflow targeting production-level end-to-end paths using real data and real user workflows — not CI smoke/unit/integration tests. Three subskills: `propose` to create an OpenSpec change with HTT-ready test cases (automatic scripts and interactive guides) by invoking `openspec-propose` or `openspec-ff-change`, `revise` to update an existing OpenSpec change so its artifacts support hack-through-testing-driven implementation and testing, and `run` to exercise an implemented OpenSpec change through the full hack-through-testing loop (in-place by default, or in a disposable snapshot worktree when requested). Use when the user explicitly asks for `openspec-ext-hack-through-test`, points to `openspec/changes/...` while asking to propose, revise, run, exercise, or prepare work under hack-through-testing principles, or wants OpenSpec work shaped for fast blocker discovery through patch-forward testing.

7 stars

byigamenovoer

View on GitHub Installation ↓

Best use case

openspec-ext-hack-through-test is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using openspec-ext-hack-through-test should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/openspec-ext-hack-through-test/SKILL.md --create-dirs "https://raw.githubusercontent.com/igamenovoer/magic-context/main/skills/openspec-ext/openspec-ext-hack-through-test/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/openspec-ext-hack-through-test/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How openspec-ext-hack-through-test Compares

Feature / Agent	openspec-ext-hack-through-test	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# OpenSpec Extension: Hack Through Test

Manual invocation only: use this skill only when the user explicitly wants this workflow.

Use this skill as the OpenSpec-specific version of hack-through-testing.

It has three subskills:

- `propose`: create an OpenSpec change with HTT-ready test cases (automatic scripts and interactive guides) by invoking `openspec-propose` or `openspec-ff-change`
- `revise`: revise an existing OpenSpec change so its artifacts support HTT-friendly implementation and testing
- `run`: drive an implemented OpenSpec change through the full hack-through-testing loop (in-place by default, or in a disposable snapshot worktree when requested)

This skill is self-contained. Use only the files bundled inside this skill directory for its workflow and references.

## Testing Philosophy: Production-Level End-to-End, Not CI

Hack-through-testing targets **production-level end-to-end paths**: real data, real user workflows, real API calls, real output artifacts. It is not a CI smoke run, not a unit test harness, and not a mock-based integration check.

In all three subskills, the canonical path to propose, revise for, or run must be a **real production user scenario** — the full flow a real user would perform, end to end. Do not treat existing CI test suites, smoke tests, or mock-based integration tests as the target.

**If the only testable surface you can identify from the change artifacts or repository context is CI-style**, stop and ask the user what the real production user path or end-to-end scenario is before proceeding. For example:

> I can see unit/smoke/integration tests already covered by CI. What's the real production user path you want to exercise — the end-to-end scenario, the live data workflow, or a specific user journey?

## Mode Selection

Choose the subskill before doing deeper work:

- Use `propose` when the user wants a design or implementation approach, there is no existing change to edit, or the request is grounded mainly in chat context.
- Use `revise` when the user points to an existing OpenSpec change and wants its proposal, design, specs, or tasks made compatible with hack-through-testing principles.
- Use `run` when the user says the OpenSpec change is already implemented, or clearly asks to run, test, exercise, or patch forward through the implementation.

If the request is ambiguous:

- prefer `revise` for an existing change path when the ask is about design or artifact readiness
- prefer `run` only when the user clearly wants execution against implementation
- ask one concise question only if choosing the wrong mode would waste substantial work

### Run Isolation Mode

When using `run`, determine the isolation mode from context:

- **in-place** (default): stash uncommitted changes, operate directly in the current workspace. Workarounds are captured as stash snapshots (not commits) — the current branch stays clean.
- **worktree**: create a disposable snapshot worktree and throwaway branch. Full isolation from the user's checkout. Use only when the user explicitly asks for a worktree, shadow repo, temporary repo, or similar.

If the user does not specify → default to **in-place**.

## Internal References

Read `references/hack-through-testing-principles.md` in every mode.

Then read the mode-specific references:

- `propose` or `revise`: `references/propose-revise-checklist.md`
- `run`: `references/run-mode-openspec-adaptation.md`

For `run`, also use these bundled resources:

- `scripts/create_snapshot_worktree.sh` (worktree mode only)
- `references/log-template.md`
- `references/issue-template.md`
- `references/git-snapshot-plumbing.md` (worktree mode only)

## Shared OpenSpec Context Rules

For `revise` and `run`, resolve the change through OpenSpec CLI before assuming artifact layout:

```bash
openspec status --change "<change-name>" --json
openspec instructions apply --change "<change-name>" --json
```

Use these outputs to determine:

- `changeDir`
- `schema`
- `contextFiles`
- current artifact status

Read every existing file referenced by `contextFiles`.
If a value is a glob such as `specs/**/*.md`, expand it on disk and read the matching files.

Use these commands when they add signal:

```bash
openspec show --type change --json --no-interactive "<change-name>"
openspec validate --type change --strict --json --no-interactive "<change-name>"
```

Do not assume files such as `proposal.md` or `design.md` exist without checking the OpenSpec output first.

## Shared Artifact Conventions For `propose` And `revise`

When the change includes automatic or agent-driven test execution as part of the intended delivery, use these artifact conventions unless the user explicitly asks for a different structure or the target repository already has a stronger local convention:

### Design-phase artifacts inside the OpenSpec change

- Add or revise `openspec/changes/<change>/testplans/`.
- Store one Markdown plan per automatic case as `testplans/case-<case-id>.md`.
- Treat these `testplans/` files as design-phase artifacts, not as final implementation docs.
- Each `case-*.md` should describe:
- goal
- intended implemented assets
- intended runner surface
- ordered steps
- expected evidence
- failure signals
- Each `case-*.md` should include at least one Mermaid `sequenceDiagram` that shows the canonical flow for that case.

### Implemented artifacts under the target implementation root

- Choose the implementation root based on the feature's owned directory structure. For example, for a tutorial pack or demo pack, prefer a pack-local directory such as `<implementation-root>/autotest/`.
- Put implemented test assets under `<implementation-root>/autotest/`.
- For each case, create two variants:

#### Automatic variant

- `autotest/case-<case-id>.<case-script-ext>` — an executable script that runs unattended and exits with a clear pass/fail signal.
- Choose `<case-script-ext>` to match the target project, operating system, and execution model. It is not fixed to `.sh`.

#### Interactive variant

- `autotest/case-<case-id>.md` — a step-by-step interactive test guide designed for agent-driven execution with user observation.
- Each interactive guide must contain inline instructions that explain what to do at each step, what to observe, and what success or failure looks like.
- Do not reduce interactive guides to "run `case-<id>.<ext>`". They are independent test procedures where an agent executes steps on the user's behalf while the user watches results and decides how to proceed.
- Structure each guide as an ordered sequence of steps. Each step should include:
- what the agent should do (command, action, or check)
- what the expected outcome is
- what to look for to confirm success or detect failure
- decision points where the user may choose to continue, retry, or investigate

- Do not require the implemented interactive guides to match the design-phase `openspec/.../testplans/` files line by line. They describe the shipped behavior for interactive testing; the change-owned `testplans/` remain the design source of truth.

### Shared implementation helpers

- Put shared automatic-test scripts, shell libraries, and reusable helper functions under `<implementation-root>/autotest/helpers/`.
- Case implementations should call or source shared logic from `autotest/helpers/` instead of duplicating common behavior.

### Standalone harness script

- Use a standalone harness script rather than bundling HTT case selection into an unrelated operator/demo wrapper.
- Place the harness under `<implementation-root>/autotest/<project-dependent-harness-script>`.
- Choose the harness language and extension to match the target project, operating system, and execution model. Use the same reasoning for each implemented case script. Examples:
- `.sh` for POSIX shell-first repos
- `.ps1` for PowerShell-first Windows automation
- `.py` for Python-oriented projects
- `.ts` for TypeScript/Node-oriented projects
- The harness should own case selection, shared preflight orchestration, and dispatch into the implemented `case-*.<case-script-ext>` files.

## Subskill: `propose`

Use `propose` to create an OpenSpec change with HTT-ready test cases — both automatic scripts and interactive guides.

### Goal

Create an OpenSpec change centered on one canonical **production-level end-to-end user path**: the full flow a real user would perform with real data, from input to output. The change must include test case designs (both automatic and interactive variants) that favor fail-fast behavior, explicit artifact capture, and an implementation order that supports patch-forward discovery.

If the feature intent from context does not make the real user path obvious, ask the user to describe it before designing. Do not default to designing a CI-style test harness.

### Output

An OpenSpec change created by invoking `openspec-propose` or `openspec-ff-change`, with test case artifacts included in the change design.

### Workflow

1. Gather the feature intent from chat context and only the repository files needed to ground the design.
2. Identify the first canonical path worth automating or driving forward.
3. Define:
- runner surface or command shape
- fixtures, samples, or stub boundaries
- fail-fast and timeout behavior
- external dependency safety boundaries
- logs, outputs, and artifacts worth preserving
- implementation order that enables incremental validation
- design-phase `testplans/` layout with both automatic and interactive case plans
- implemented `autotest/` layout: automatic scripts (`case-<id>.<ext>`) and interactive guides (`case-<id>.md`)
4. Invoke `openspec-propose` (for a single-step proposal) or `openspec-ff-change` (to fast-forward through all artifacts) to create the OpenSpec change. Pass the HTT design context so the change artifacts encode the canonical path, test case variants, and safety model.
5. After the change is created, verify that the resulting artifacts include test case plans for both automatic and interactive variants.
6. Call out open questions that would materially affect safe or useful hack-through-testing.

## Subskill: `revise`

Use `revise` to update an existing OpenSpec change so it supports hack-through-testing-friendly implementation and later execution.

### Goal

Revise the change artifacts in place so the implementation can expose one canonical non-interactive path, fail fast on missing prerequisites, preserve useful outputs, and isolate unsafe dependencies during automated or agent-driven testing.

### Output

Update the existing change artifacts in place.
Typical targets are:

- `proposal.md`
- `design.md`
- `specs/**/*.md`
- `tasks.md`

### Workflow

1. Resolve the change with the shared OpenSpec CLI rules.
2. Use the bundled checklist to identify the canonical path and the main HTT-compatibility gaps.
3. Revise the change artifacts so they encode stable capabilities rather than throwaway workaround details.
4. Make proposal, design, specs, and tasks point at the same canonical path and safety model.
5. When test cases are part of the design, make the artifacts agree on:
- design-phase `openspec/changes/<change>/testplans/case-*.md` covering both automatic and interactive variants
- Mermaid sequence diagrams in each case plan
- implemented automatic scripts: `<implementation-root>/autotest/case-*.<ext>`
- implemented interactive guides: `<implementation-root>/autotest/case-*.md`
- shared helpers under `<implementation-root>/autotest/helpers/`
- a standalone harness under `<implementation-root>/autotest/<project-dependent-harness-script>`
6. Validate the change when helpful:

```bash
openspec validate --type change --strict --json --no-interactive "<change-name>"
```

If important product decisions are still unresolved, make the smallest safe assumption only when the artifacts already lean that way; otherwise record the open question or tell the user a review or decision pass is still needed.

## Subskill: `run`

Use `run` when the OpenSpec change is already implemented and the user wants the full hack-through-testing workflow.

### Goal

Drive the implemented change forward along the **real production user path** — using real data and real service calls within safe limits — patching around blockers just enough to reach later failures quickly, while keeping every workaround reviewable and ending with a synthesis of the real fixes.

By default, operate in-place (stash + test on current branch). Use a disposable snapshot worktree only when the user explicitly requests it.

Do not target existing CI tests, unit tests, or smoke scripts as the canonical path. If the implementation's only runnable surface is CI-oriented, stop and ask the user what the real end-to-end user scenario looks like before starting the loop.

### Output

Produce:

- a helper-managed HTT log directory with a session log, issue notes, and saved run artifacts
- disposable workaround stash snapshots (in-place mode) or workaround commits on a throwaway branch (worktree mode)
- a final synthesis that separates throwaway unblockers from durable fixes

### Workflow

1. Resolve the change with the shared OpenSpec CLI rules.
2. Use `references/run-mode-openspec-adaptation.md` to determine:
- the canonical path to exercise
- the likely implementation entrypoints
- whether the implementation is complete enough to run
3. Snapshot and prepare:

**In-place mode (default):**

Stash the user's uncommitted changes, including untracked files:

```bash
git stash push --include-untracked -m "hacktest snapshot <timestamp>"
```

Record the stash ref in the session log. Testing proceeds from clean HEAD on the current branch.

Each subsequent workaround is captured as a stash snapshot without disturbing the working tree:

```bash
stash_sha=$(git stash create)
git stash store -m "hacktest <issue-id>: <short description>" "$stash_sha"
```

Create the log and runs directories:

```bash
mkdir -p <htt-home>/logs/issues <htt-home>/runs
```

**Worktree mode:**

Snapshot the current repository state into a throwaway worktree with the bundled helper:

```bash
bash ./scripts/create_snapshot_worktree.sh --topic TOPIC_SLUG
```

Optional arguments:

```bash
bash ./scripts/create_snapshot_worktree.sh --repo PATH --topic TOPIC_SLUG --branch hacktest/TOPIC_SLUG --htt-home HTT_HOME --path WORKTREE_PATH
```

4. Start logs using the bundled templates. Record isolation mode and stash ref (in-place) or `htt-branch` and worktree path (worktree).
5. Run the standard hack-through-testing loop:
- execute the next step in the canonical path
- record failures and save artifacts
- apply the smallest reversible workaround that unlocks progress
- re-run to verify the workaround
- **worktree mode:** commit only verified workaround steps
- **in-place mode:** create a stash snapshot after each verified workaround; record the stash ref in the issue note
- continue until success, stop-rule exhaustion, or a high-risk boundary
6. Finish with a synthesis that maps findings back to the OpenSpec change and identifies any needed follow-up in `revise` mode.

If the change is not implemented enough to exercise responsibly, stop and report that the correct next step is `propose` or `revise`, with the concrete evidence that led to that conclusion.

## Shared Heuristics

- Prefer one meaningful canonical path over broad but vague coverage.
- Prefer machine-detectable outcomes over human-only verification.
- Prefer explicit preflight failure over ambiguous hangs.
- Prefer realistic inputs with safe external boundaries.
- Prefer capturing logs and generated artifacts so disposable runs still produce durable evidence.
- Translate temporary workaround ideas into stable design capabilities when working in `propose` or `revise`.

## Example Prompts

- `Use $openspec-ext-hack-through-test in propose mode and create an OpenSpec change with HTT-ready test cases for this feature.`
- `Use $openspec-ext-hack-through-test in revise mode on openspec/changes/<change> and update the artifacts for HTT compatibility.`
- `Use $openspec-ext-hack-through-test in run mode on openspec/changes/<change> and patch forward through the implemented flow.`
- `Use $openspec-ext-hack-through-test in run mode with a worktree on openspec/changes/<change> so my checkout stays clean.`
- `Take this OpenSpec change and either revise or run it under hack-through-testing principles, depending on what the current state supports.`

Pointing to a file or directory under `openspec/` counts as the same trigger signal as explicitly saying `openspec`.

## Guardrails

- Do not revise change artifacts in `run` mode unless the user explicitly asks to switch modes.
- Do not assume a fixed OpenSpec artifact layout; use OpenSpec CLI output first.
- Do not reference workflow files outside this skill directory.
- Do not present temporary workarounds discovered in `run` mode as the final fix.
- Do not encode disposable-worktree mechanics as permanent product requirements when using `propose` or `revise`.
- In in-place mode, always stash before starting and record the stash ref. Never drop the initial stash until the user explicitly requests cleanup.
- In worktree mode, never merge the throwaway branch into real work.
- Do not collapse design-phase OpenSpec `testplans/` and implemented `autotest/` artifacts into the same artifact role; keep the design-versus-implementation distinction explicit.
- Do not hide shared helper logic inside unrelated case scripts when the target implementation has multiple automatic cases; direct shared logic into `autotest/helpers/`.
- Do not reduce interactive guides (`autotest/case-*.md`) to wrappers that just say "run the automatic script"; they must be independent step-by-step procedures for agent-driven execution with user observation.
- Do not keep going silently once the remaining path forward would require high-risk product changes or unsafe external side effects.
- Do not target CI-style tests (unit, smoke, mock-based integration) as the canonical path. Ask the user for the real production user path if it is not clear from context.

Related Skills

openspec-ext-revise-by-decision

from igamenovoer/magic-context

Manual invocation only; use only when the user explicitly requests `openspec-ext-revise-by-decision` by exact name. Revise OpenSpec change artifacts from a review or decision document that contains questions plus `DECISION` blocks, applying chosen decisions from a review file such as `openspec/changes/<change>/review/review-*.md` back into proposal, design, specs, and tasks.

openspec-ext-review-plan

from igamenovoer/magic-context

Review an OpenSpec change (or a single OpenSpec change artifact file) for completeness, coherence, and alignment with existing system design; capture actionable feedback plus open questions; write a review report under the change directory (review/review-YYYYMMDD-HHMMSS.md).

openspec-ext-respond-to-review

from igamenovoer/magic-context

Read an OpenSpec review report critically, evaluate the reviewer's proposals and findings against the current change artifacts and repository context, and write developer-owned final decisions/responses back into the review document. Use when the user explicitly mentions `openspec` or points to a path under `openspec/` while asking to examine a review report carefully, decide open questions, respond to findings, fill `DECISION` blocks, respond to an OpenSpec review file, or record final answers in an OpenSpec review document without yet revising the proposal, design, specs, or tasks.

openspec-ext-explain

from igamenovoer/magic-context

Create or update OpenSpec change explanation docs that capture developer-facing questions and answers under `openspec/changes/.../explain/`. Use when the user explicitly mentions `openspec` or points to a path under `openspec/` while asking to create, update, document, or maintain a Q&A, FAQ, explain note, or question-and-answer doc for an OpenSpec change based on user questions, implementation notes, review questions, or current chat context.

test-and-log

from igamenovoer/magic-context

Test a target (script, demo, pipeline, CLI command, integration) without modifying any source code, then write a structured log of the process, outcomes, anomalies, and issues. Use when the user says "test X and log", "run X and document findings", or "try X without changing code". Default log location is context/logs/TIMESTAMP-task-name/TIMESTAMP.md.

hack-through-testing

from igamenovoer/magic-context

Manual invocation only. Drive a crashy, hanging, or half-broken system forward along a real production user path using real data. Two subskills: `prepare` to analyze the target and set up `<htt-home>/` with infrastructure dirs (logs, runs, issues); optionally creates `<htt-home>/autotest/` with automatic scripts and interactive guides only when the developer explicitly requests test-case generation. `run` drives testing — with or without autotest artifacts — patching forward through blockers. Run subskill operates in-place by default (stash + test on current branch) or in a disposable snapshot worktree when explicitly requested. Supports automatic and interactive driving. Default when ambiguous: both subskills, in-place, automatic. Not for CI-oriented unit, smoke, or mock-based integration tests.

do-interactive-test

from igamenovoer/magic-context

Prepare for and run user-driven interactive testing of a directory the user points to. Use when the user wants the agent to read what is already there first, be prepared, follow step-by-step test instructions, or honor a constrained edit boundary during testing. Handle generic directories, demo/tutorial directories, and OpenSpec change directories differently; for OpenSpec change directories, use openspec CLI commands to gather context instead of assuming a file layout inside the directory. During interactive testing, do not automatically modify the system under test; report issues first, let the developer decide whether to log them or proceed to a fix, and only modify demo-specific code when a fix is explicitly requested. Do not create extra logs unless the developer asks for issue logging or step logging.

pixi-make-offline-channel

from igamenovoer/magic-context

Use when the user wants to create a self-hosted, offline-installable Conda channel (mirror) containing a specific subset of packages using Pixi.

pixi-make-cu-build-env

from igamenovoer/magic-context

Guides the agent to setup a new or existing Pixi environment for compiling C++ and CUDA code. It ensures the correct compilers, toolkits, and CMake configurations are in place for a robust user-space build.

pixi-install-nvidia

from igamenovoer/magic-context

Use when the user says "use pixi to install <some nvidia tool>" (or similar) and wants NVIDIA/CUDA/GPU packages installed via Pixi (no sudo/apt), e.g., CUDA toolkit pieces, cuDNN/NCCL, PyTorch CUDA builds, RAPIDS.

pei-docker-usage

from igamenovoer/magic-context

Helper for PeiDocker (`pei-docker-cli`). Trigger ONLY when the user explicitly requests PeiDocker usage OR when working within a PeiDocker-generated project (indicated by `user_config.yml`).

conan-basic-usage

from igamenovoer/magic-context

Basic operations for the Conan C++ package manager. Use when the user explicitly asks to 'use conan' for tasks like creating projects, installing dependencies, or building packages, or asks for 'how to' guidance on Conan setup.