tracking-regression-tests

Track and manage regression test suites across releases. Use when performing specialized testing. Trigger with phrases like "track regressions", "manage regression suite", or "validate against baseline".

1,868 stars

byjeremylongshore

View on GitHub Installation ↓

Best use case

tracking-regression-tests is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using tracking-regression-tests should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/tracking-regression-tests/SKILL.md --create-dirs "https://raw.githubusercontent.com/jeremylongshore/claude-code-plugins-plus-skills/main/plugins/testing/regression-test-tracker/skills/tracking-regression-tests/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/tracking-regression-tests/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How tracking-regression-tests Compares

Feature / Agent	tracking-regression-tests	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

AI Agents for Coding

Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.

Best AI Skills for Claude

Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.

ChatGPT vs Claude for Agent Skills

Compare ChatGPT and Claude for AI agent skills across coding, writing, research, and reusable workflow execution.

SKILL.md Source

# Regression Test Tracker

## Overview

Track, manage, and maintain regression test suites across releases to ensure previously fixed bugs stay fixed and existing features remain stable. Maps regression tests to bug tickets, monitors test health over time, and identifies gaps where fixed bugs lack corresponding regression tests.

## Prerequisites

- Test framework with tagging or marking support (`@tag`, `pytest.mark`, JUnit `@Tag`)
- Bug tracking system with issue IDs (GitHub Issues, Jira, Linear)
- Git history accessible for correlating bug fixes with test additions
- Existing test suite with named test cases
- CI pipeline producing test result artifacts (JUnit XML or JSON)

## Instructions

1. Scan the codebase with Grep for bug-fix commits by searching commit messages for patterns like `fix:`, `bugfix`, `closes #`, or Jira ticket IDs.
2. For each bug-fix commit, check whether a corresponding regression test exists:
   - Search test files for the bug ticket ID in test names, comments, or tags.
   - Verify the test exercises the specific code path that was fixed.
   - Flag fixes without regression tests as coverage gaps.
3. Create a regression test inventory file (`regression-tests.json` or `regression-tests.md`) mapping:
   - Bug ticket ID to test file and test name.
   - Severity of the original bug (critical, high, medium, low).
   - Date the regression test was added.
   - Current test status (passing, failing, skipped).
4. Tag existing regression tests with metadata for traceability:
   - Jest: Add `// @regression BUG-123` comments or use `describe.each` with ticket data.
   - pytest: Apply `@pytest.mark.regression` and `@pytest.mark.bug("BUG-123")` markers.
   - JUnit: Use `@Tag("regression")` and `@DisplayName("BUG-123: description")`.
5. Generate a regression coverage report showing:
   - Total bugs fixed vs. bugs with regression tests (coverage percentage).
   - Untested regressions ranked by severity.
   - Tests that have become flaky or were disabled.
6. Set up a CI check that fails the build if a bug-fix PR does not include at least one regression test.
7. Schedule periodic audits (weekly or per-release) to verify all regression tests still pass and remain relevant.

## Output

- Regression test inventory file mapping bugs to tests
- Tagged test files with regression markers and ticket references
- Coverage gap report listing bug fixes without regression tests
- CI configuration for regression test enforcement
- Release readiness checklist based on regression suite pass rate

## Error Handling

| Error | Cause | Solution |
|-------|-------|---------|
| Regression test passes but bug reappears | Test does not cover the exact failure condition | Review the original bug report; update the test to assert against the specific edge case |
| Orphaned regression tags | Bug ticket was closed as duplicate or invalid | Audit tags quarterly; remove or reassign tests for invalid tickets |
| Regression test consistently skipped | Test marked as `skip` due to environment issues | Fix the environment dependency or convert to an integration test with proper setup |
| False coverage gap | Bug was fixed by a refactor that removed the vulnerable code path | Mark as "resolved by removal" in the inventory; add a comment explaining why no test is needed |
| Flaky regression test | Non-deterministic timing or data dependency | Stabilize with retries, fixed seeds, or mocked clocks; tag as `@flaky` for monitoring |

## Examples

**pytest regression test with marker:**
```python
import pytest

@pytest.mark.regression
@pytest.mark.bug("GH-1042")  # 1042 = configured value
def test_csv_export_handles_unicode_characters():
    """Regression: GH-1042 -- CSV export crashed on non-ASCII names."""
    result = export_csv([{"name": "Rene"}])
    assert "Rene" in result
    assert result.startswith("name\n")
```

**Jest regression test with ticket reference:**
```typescript
describe('BUG-789: Cart total calculation', () => {  # 789 = configured value
  it('applies percentage discount before tax', () => {
    const cart = createCart([{ price: 100, qty: 2 }]);
    cart.applyDiscount({ type: 'percent', value: 10 });
    expect(cart.subtotal).toBe(180);
    expect(cart.tax).toBe(18); // 10% tax on discounted subtotal
  });
});
```

**Regression inventory entry:**
```json
{
  "BUG-1042": {  # 1042 = configured value
    "test_file": "tests/test_export.py::test_csv_export_handles_unicode_characters",
    "severity": "high",
    "added": "2026-01-15",  # 2026 year
    "status": "passing"
  }
}
```

## Resources

- pytest markers: https://docs.pytest.org/en/stable/how-to/mark.html
- JUnit 5 tags: https://junit.org/junit5/docs/current/user-guide/#writing-tests-tagging-and-filtering
- Jest describe/test organization: https://jestjs.io/docs/api#describename-fn
- Regression testing best practices: https://martinfowler.com/bliki/SelfTestingCode.html

Related Skills

testing-visual-regression

1868

from jeremylongshore/claude-code-plugins-plus-skills

Detect visual changes in UI components using screenshot comparison. Use when detecting unintended UI changes or pixel differences. Trigger with phrases like "test visual changes", "compare screenshots", or "detect UI regressions".

generating-unit-tests

1868

from jeremylongshore/claude-code-plugins-plus-skills

Test automatically generate comprehensive unit tests from source code covering happy paths, edge cases, and error conditions. Use when creating test coverage for functions, classes, or modules. Trigger with phrases like "generate unit tests", "create tests for", or "add test coverage".

managing-snapshot-tests

1868

from jeremylongshore/claude-code-plugins-plus-skills

Create and validate component snapshots for UI regression testing. Use when performing specialized testing. Trigger with phrases like "update snapshots", "test UI snapshots", or "validate component snapshots".

running-smoke-tests

1868

from jeremylongshore/claude-code-plugins-plus-skills

Execute fast smoke tests validating critical functionality after deployment. Use when performing specialized testing. Trigger with phrases like "run smoke tests", "quick validation", or "test critical paths".

running-performance-tests

1868

from jeremylongshore/claude-code-plugins-plus-skills

Execute load testing, stress testing, and performance benchmarking. Use when performing specialized testing. Trigger with phrases like "run load tests", "test performance", or "benchmark the system".

running-mutation-tests

1868

from jeremylongshore/claude-code-plugins-plus-skills

Execute mutation testing to evaluate test suite effectiveness. Use when performing specialized testing. Trigger with phrases like "run mutation tests", "test the tests", or "validate test effectiveness".

running-integration-tests

1868

from jeremylongshore/claude-code-plugins-plus-skills

Execute integration tests validating component interactions and system integration. Use when performing specialized testing. Trigger with phrases like "run integration tests", "test integration", or "validate component interactions".

running-e2e-tests

1868

from jeremylongshore/claude-code-plugins-plus-skills

Execute end-to-end tests covering full user workflows across frontend and backend. Use when performing specialized testing. Trigger with phrases like "run end-to-end tests", "test user flows", or "execute E2E suite".

managing-database-tests

1868

from jeremylongshore/claude-code-plugins-plus-skills

Test database testing including fixtures, transactions, and rollback management. Use when performing specialized testing. Trigger with phrases like "test the database", "run database tests", or "validate data integrity".

running-chaos-tests

1868

from jeremylongshore/claude-code-plugins-plus-skills

Execute chaos engineering experiments to test system resilience. Use when performing specialized testing. Trigger with phrases like "run chaos tests", "test resilience", or "inject failures".

tracking-service-reliability

1868

from jeremylongshore/claude-code-plugins-plus-skills

Define and track SLAs, SLIs, and SLOs for service reliability including availability, latency, and error rates. Use when establishing reliability targets or monitoring service health. Trigger with phrases like "define SLOs", "track SLI metrics", or "calculate error budget".

tracking-application-response-times

1868

from jeremylongshore/claude-code-plugins-plus-skills

Track and optimize application response times across API endpoints, database queries, and service calls. Use when monitoring performance or identifying bottlenecks. Trigger with phrases like "track response times", "monitor API performance", or "analyze latency".