always-works-testing

Default testing standard for all implementation work - ensures code actually works through mandatory execution validation before confirming to user. Applies automatically whenever writing, modifying, debugging, or implementing any code (scripts, APIs, UI, configs, data operations, logic changes). This is the baseline expectation, not an optional extra - every implementation must be verified through actual execution, not assumed correct.

16 stars

Best use case

always-works-testing is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Default testing standard for all implementation work - ensures code actually works through mandatory execution validation before confirming to user. Applies automatically whenever writing, modifying, debugging, or implementing any code (scripts, APIs, UI, configs, data operations, logic changes). This is the baseline expectation, not an optional extra - every implementation must be verified through actual execution, not assumed correct.

Teams using always-works-testing should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/always-works-testing/SKILL.md --create-dirs "https://raw.githubusercontent.com/diegosouzapw/awesome-omni-skill/main/skills/development/always-works-testing/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/always-works-testing/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How always-works-testing Compares

Feature / Agentalways-works-testingStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Default testing standard for all implementation work - ensures code actually works through mandatory execution validation before confirming to user. Applies automatically whenever writing, modifying, debugging, or implementing any code (scripts, APIs, UI, configs, data operations, logic changes). This is the baseline expectation, not an optional extra - every implementation must be verified through actual execution, not assumed correct.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# Always Works™ Testing Philosophy

Distinguish between "should work" (theory) and "does work" (reality) through systematic verification.

## Core Principles

1. **Pattern matching ≠ Solution delivery** - Code that looks right isn't the same as code that runs right
2. **Solve problems, not write code** - The goal is working functionality, not elegant untested logic
3. **Untested code = Guess** - Without execution verification, it's speculation

## The 30-Second Reality Check

Before claiming something works, answer YES to ALL:

1. **Did I run/build the code?** - Not just read it, actually execute it
2. **Did I trigger the exact feature I changed?** - Not similar code, the specific modification
3. **Did I see the expected result with my own observation?** - Including GUI, terminal output, logs
4. **Did I check for error messages?** - Stderr, console errors, warning logs
5. **Would I bet $100 this works?** - Confidence based on observation, not assumption

If any answer is NO or UNCERTAIN → Test before confirming to user.

## Forbidden Phrases

Don't use these without actual verification:

- "This should work now"
- "I've fixed the issue" (especially on 2nd+ attempt)
- "Try it now" (without trying it yourself)
- "The logic is correct so..."

These phrases signal untested assumptions. Replace with observation-based confirmations only after testing.

## Test Requirements by Change Type

Execute appropriate verification for each change:

- **UI Changes**: Click the actual button/link/form in a browser
- **API Changes**: Make the actual API call with curl/Postman/code
- **Data Changes**: Query the actual database and verify results
- **Logic Changes**: Run the specific scenario with test inputs
- **Config Changes**: Restart the service and verify it loads correctly
- **Script Changes**: Execute the script with representative inputs

## Application in Claude's Context

When working with computer tools:

1. **After writing code**: Use `bash_tool` to execute and verify output
2. **For file changes**: Use `view` to confirm changes, then test functionality
3. **For scripts**: Run with sample inputs, check exit codes and output
4. **For syntax**: Don't just check syntax - run the code
5. **For web content**: If creating HTML/JS, verify it actually renders/executes

## The Embarrassment Test

Before responding to user: "If the user records trying this and it fails, will I feel embarrassed to see their face?"

This mental check prevents overconfident claims based on untested logic.

## Reality Economics

- Time saved skipping tests: **30 seconds**
- Time wasted when it fails: **30 minutes**  
- User trust lost: **Immeasurable**

When users report the same bug repeatedly, they're not thinking "the AI is trying hard" - they're thinking "why am I wasting time with an unreliable tool?"

## Mandatory Workflow

Apply this sequence for every implementation:

1. **Write/modify code**
2. **Run the 30-Second Reality Check** - Honest self-assessment
3. **Execute actual test** - Use available tools to verify
4. **Observe results** - Check stdout, stderr, GUI, logs, database
5. **Confirm to user ONLY after verification** - Base confidence on observation

## When Full Testing Isn't Possible

If you cannot perform complete verification (no access to prod environment, missing credentials, etc.):

- **Explicitly state the limitation** to the user
- **List what you verified** and what you couldn't
- **Don't imply full verification** when only partial testing occurred
- **Recommend what the user should test** before deploying

Example: "I've verified the syntax and logic structure, but I cannot test the actual API calls without credentials. You should test: [specific scenarios]"

Related Skills

dotnet-ui-testing-core

16
from diegosouzapw/awesome-omni-skill

Tests UI across frameworks. Page objects, test selectors, async waits, accessibility.

agricultural-easement-negotiation-frameworks

16
from diegosouzapw/awesome-omni-skill

Expert in negotiating utility easements with farmers including farm operation impact assessment (crop production, livestock, equipment), compensation structure design (one-time vs. recurring, mitigation works), and multi-generational farm psychology. Use when negotiating transmission line, pipeline, or drainage easements with agricultural landowners. Key terms include agricultural easement, farm operation impacts, tower placement, crop loss, irrigation impacts, easement compensation, farm succession

accessibility-testing

16
from diegosouzapw/awesome-omni-skill

Guide for conducting comprehensive accessibility audits of code to identify WCAG compliance issues and barriers to inclusive design. This skill should be used when reviewing accessibility, ARIA implementation, keyboard navigation, or screen reader compatibility.

minitest-testing

16
from diegosouzapw/awesome-omni-skill

Write, review, and improve Minitest tests for Ruby on Rails applications. Covers model tests, controller tests, system tests, fixtures, and best practices from Rails Testing Guide.

data-quality-frameworks

16
from diegosouzapw/awesome-omni-skill

Implement data quality validation with Great Expectations, dbt tests, and data contracts. Use when building data quality pipelines, implementing validation rules, or establishing data contracts.

ai-powered-pentesting

16
from diegosouzapw/awesome-omni-skill

Guide for AI-powered penetration testing tools, red teaming frameworks, and autonomous security agents.

ab-testing-analyzer

16
from diegosouzapw/awesome-omni-skill

全面的AB测试分析工具,支持实验设计、统计检验、用户分群分析和可视化报告生成。用于分析产品改版、营销活动、功能优化等AB测试结果,提供统计显著性检验和深度洞察。

cli-e2e-testing

16
from diegosouzapw/awesome-omni-skill

CLI E2E testing patterns with BATS - parallelization, state sharing, and timeout management

bats-testing-patterns

16
from diegosouzapw/awesome-omni-skill

Comprehensive guide for writing shell script tests using Bats (Bash Automated Testing System). Use when writing or improving tests for Bash/shell scripts, creating test fixtures, mocking commands, or setting up CI/CD for shell script testing. Includes patterns for assertions, setup/teardown, mocking, fixtures, and integration with GitHub Actions.

always-on

16
from diegosouzapw/awesome-omni-skill

always on skill for handoffs

always-init

16
from diegosouzapw/awesome-omni-skill

Universal task initializer that automatically loads PAI context for all user requests. Ensures complete context availability (contacts, preferences, protocols) before responding to any task. (project, gitignored)

adb-device-testing

16
from diegosouzapw/awesome-omni-skill

Use when testing Android apps on ADB-connected devices/emulators - UI automation, screenshots, location spoofing, navigation, app management. Triggers on ADB, emulator, Android testing, location mock, UI test, screenshot walkthrough.