wallaby-testing
Check test status and debug failing tests using Wallaby.js real-time test results. Use after making code changes to verify tests pass, when checking if tests are failing, debugging test errors, analyzing assertions, inspecting runtime values, checking coverage, updating snapshots, or when user mentions Wallaby, tests, coverage, or test status.
Best use case
wallaby-testing is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Check test status and debug failing tests using Wallaby.js real-time test results. Use after making code changes to verify tests pass, when checking if tests are failing, debugging test errors, analyzing assertions, inspecting runtime values, checking coverage, updating snapshots, or when user mentions Wallaby, tests, coverage, or test status.
Teams using wallaby-testing should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/wallaby-testing/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How wallaby-testing Compares
| Feature / Agent | wallaby-testing | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Check test status and debug failing tests using Wallaby.js real-time test results. Use after making code changes to verify tests pass, when checking if tests are failing, debugging test errors, analyzing assertions, inspecting runtime values, checking coverage, updating snapshots, or when user mentions Wallaby, tests, coverage, or test status.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
SKILL.md Source
# Wallaby Testing Skill Check test status and debug failing tests using Wallaby.js real-time test execution data. ## When to Use - **After code changes** - Verify tests pass after modifications - **Checking test status** - See if any tests are failing - **Debugging failures** - Analyze test errors and exceptions - **Inspecting runtime values** - Examine variable states during tests - **Understanding coverage** - See which code paths tests execute - **Updating snapshots** - When snapshot changes are needed - User mentions "tests", "test status", "run tests", or "Wallaby" ## Available Wallaby Tools Use these tools to gather test information: | Tool | Purpose | |------|---------| | `wallaby_failingTests` | Get all failing tests with errors and stack traces | | `wallaby_failingTestsForFile` | Get failing tests for a specific file | | `wallaby_allTests` | Get all tests (useful when there are no failures but you need test IDs) | | `wallaby_allTestsForFile` | Get tests covering/executing a specific file | | `wallaby_failingTestsForFileAndLine` | Get failing tests covering/executing a specific file and line | | `wallaby_allTestsForFileAndLine` | Get tests covering a specific line | | `wallaby_runtimeValues` | Inspect variable values at a code location | | `wallaby_runtimeValuesByTest` | Get runtime values for a specific test | | `wallaby_coveredLinesForFile` | Get coverage data for a file | | `wallaby_coveredLinesForTest` | Get lines covered by a specific test | | `wallaby_testById` | Get detailed test data by ID | | `wallaby_updateTestSnapshots` | Update snapshots for a test | | `wallaby_updateFileSnapshots` | Update all snapshots in a file | | `wallaby_updateProjectSnapshots` | Update all snapshots in the project | ### What Inputs These Tools Need - **For file-scoped tools** (like `wallaby_failingTestsForFile`, `wallaby_coveredLinesForFile`): pass the workspace-relative file path. - **For line-scoped tools** (like `wallaby_allTestsForFileAndLine`, `wallaby_runtimeValues`): pass `file`, `line`, and the exact `lineContent` string from the file. - **For test-scoped tools** (like `wallaby_testById`, `wallaby_runtimeValuesByTest`, `wallaby_coveredLinesForTest`): pass `testId` from `wallaby_failingTests` / `wallaby_allTests`. ## Debugging Workflow ### Step 1: Get Failing Tests Start by retrieving failing test information: - Use `wallaby_failingTests` to see all failures - Review error messages and stack traces - Note the test ID for further inspection If there are no failing tests but the user is asking about test status or coverage, use `wallaby_allTests` to confirm the current state and to obtain test IDs. ### Step 2: Locate Related Code (Optional) If the error and stack trace from Step 1 don't provide enough context: - Use `wallaby_coveredLinesForTest` with the test ID - Focus analysis on covered source files - Identify which code paths are executed - Skip this step if the failure cause is already clear ### Step 3: Inspect Runtime Values (Optional) Examine variable states at failure points or other points of interest: - Use `wallaby_runtimeValues` for specific locations - Use `wallaby_runtimeValuesByTest` for test-specific values - Compare expected vs actual values - Skip this step if the failure cause is already clear ### Step 4: Implement Fix Based on analysis: - Identify the root cause - Make targeted code changes - Reference runtime values in your explanation ### Step 5: Verify Fix After changes: - Wallaby re-runs tests automatically - Use `wallaby_testById` to confirm test passes - Check no regressions with `wallaby_failingTests` ### Step 6: Update Snapshots (if needed) When snapshots need updating: - Use `wallaby_updateTestSnapshots` for specific tests - Use `wallaby_updateFileSnapshots` for all in a file - Use `wallaby_updateProjectSnapshots` only when many snapshots changed - Verify tests pass after updates ## Example: Debugging an Assertion Failure <example> User: "The calculator test is failing" 1. Call wallaby_failingTests → Get test ID and error Error shows: "expected 4, got 5" in multiply function 2. (Optional) Call wallaby_coveredLinesForTest(testId) → Skip if error is clear 3. (Optional) Call wallaby_runtimeValues(file, line, expression) → Skip if cause is obvious 4. Analyze: multiply used + instead of * 5. Fix: Change + to * in calculator.js 6. Call wallaby_failingTests → Confirm no failures remain </example> ## Best Practices - **Use Wallaby tools first** - They provide real-time data without re-running tests - **Get test IDs early** - Many tools require the test ID from initial queries - **Inspect runtime values** - More reliable than guessing variable states - **Verify after fixes** - Always confirm the test passes before finishing - **Check for regressions** - Ensure fixes don't break other tests
Related Skills
web-security-testing
Web application security testing workflow for OWASP Top 10 vulnerabilities including injection, XSS, authentication flaws, and access control issues.
web-app-testing
Gemini 2.5 Computer Use for browser automation with VISIBLE local browser. Watch Gemini AI control your browser in real-time. Perfect for web app testing, automation demos, and debugging.
unit-testing-test-generate
Generate comprehensive, maintainable unit tests across languages with strong coverage and edge case focus.
treido-testing
Testing specialist for Treido (Playwright + Next.js). Use for writing/debugging E2E tests, deflaking, selectors, auth state, parallel execution, and CI stability.
testing-workflow
Meta-skill that orchestrates comprehensive testing across a project by coordinating testing-patterns, e2e-testing, and testing agents. Use when setting up testing for a new project, improving coverage for an existing project, establishing a testing strategy, or verifying quality before a release.
testing-strategy
Comprehensive guide for implementing AIDB tests following E2E-first philosophy, DebugInterface abstraction, and MCP response health standards
testing-strategies
Testing strategies, patterns, and best practices for production code
testing-services
Writes unit tests for Python service classes using Arrange-Act-Assert pattern with proper mocking at boundaries. Tests behavior, not implementation. Mocks external systems only (API calls, file I/O, databases). Use when writing tests for services or fixing test coverage.
testing-quality
Plans and executes comprehensive testing strategy across frontend, backend, and AI tiers. Activates when writing tests, testing features, setting up test infrastructure, checking coverage, running E2E tests, or performance testing. Does not handle writing production code (backend-developer or frontend-developer), vulnerability/security review (security), or infrastructure deployment (devops).
testing-patterns
Testing patterns using bun:test with in-memory SQLite. Use when writing unit tests, integration tests, or router tests.
testing-obsessive
This skill should be used when the user mentions "write tests", "test coverage", "testing strategy", "unit tests", "integration tests", "e2e tests", "vitest", "jest", discusses testing approaches, asks about test patterns, or works on test files. Addresses testing fundamentals with emphasis on Vitest and Svelte component testing using pragmatic, risk-based approaches.
testing
Comprehensive testing specialization covering test strategy, automation, TDD methodology, test writing, and web app testing. Use when setting up test infrastructure, writing tests, implementing TDD workflows, analyzing coverage, integrating tests into CI/CD, or testing web applications with Playwright. Framework-agnostic approach with framework-specific guidance via reference files.