nw-production-readiness

Monitoring, observability, operational procedures, CI/CD lessons learned, and quality gate definitions. Load when assessing production readiness or validating operational excellence.

322 stars

bynWave-ai

View on GitHub Installation ↓

Best use case

nw-production-readiness is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Monitoring, observability, operational procedures, CI/CD lessons learned, and quality gate definitions. Load when assessing production readiness or validating operational excellence.

Teams using nw-production-readiness should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/nw-production-readiness/SKILL.md --create-dirs "https://raw.githubusercontent.com/nWave-ai/nWave/main/nWave/skills/nw-production-readiness/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/nw-production-readiness/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How nw-production-readiness Compares

Feature / Agent	nw-production-readiness	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Monitoring, observability, operational procedures, CI/CD lessons learned, and quality gate definitions. Load when assessing production readiness or validating operational excellence.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

AI Agent for Product Research

Browse AI agent skills for product research, competitive analysis, customer discovery, and structured product decision support.

SKILL.md Source

# Production Readiness

## Monitoring and Observability

### Application Monitoring
- **Performance**: response time | throughput | latency percentiles (P50, P95, P99)
- **Resources**: CPU | memory | database connections | cache hit rates
- **Errors**: exception tracking | error rate trends | integration failure detection
- **Business**: KPI tracking | conversion funnels | feature usage | revenue impact

### Infrastructure Monitoring
Server/container health and resource utilization | Network performance and connectivity | Storage capacity and I/O performance | Security event detection.

### Alerting Tiers
| Tier | Condition | Response |
|------|-----------|----------|
| Page | Service down, data loss risk, security breach | Immediate response |
| Urgent | Error rate >2x baseline, latency SLA breach | Response within 15 min |
| Warning | Capacity >80%, error rate trending up | Response within 1 hour |
| Info | Deployment complete, metric threshold crossed | Review next business day |

## Operational Procedures

### Incident Response
1. Detect: automated alerting identifies issue
2. Triage: classify severity, assign responder
3. Communicate: notify stakeholders per severity level
4. Resolve: apply fix or rollback
5. Review: post-incident review within 48 hours
6. Improve: update runbooks and monitoring based on findings

### Maintenance Procedures
Regular update and patching schedule | Backup verification (test restores quarterly) | Security vulnerability scanning (automated, weekly) | Performance baseline recalibration (after major changes).

### Knowledge Transfer
Operational runbooks for common procedures | Architecture documentation with system diagrams | Deployment procedures and configuration management | Troubleshooting guides for known failure modes.

## Quality Gates for Production Readiness

Before declaring production-ready, all must pass:
- [ ] All acceptance tests passing
- [ ] Unit coverage meets project standard (default: >= 80%)
- [ ] Integration tests validated
- [ ] Performance validated under realistic load
- [ ] Security scan completed (0 critical, 0 high)
- [ ] Monitoring and alerting configured
- [ ] Logging structured and searchable
- [ ] Rollback procedure documented and tested
- [ ] Runbook created for operational procedures
- [ ] On-call team trained on new feature

For CI/CD architecture lessons and measurement coupling pitfalls, see `cicd-and-deployment` skill.

Related Skills

nw-production-safety

322

from nWave-ai/nWave

Agent safety boundaries - input validation, output filtering, scope constraints, and document creation policy

nw-ux-web-patterns

322

from nWave-ai/nWave

Web UI design patterns for product owners. Load when designing web application interfaces, writing web-specific acceptance criteria, or evaluating responsive designs.

nw-ux-tui-patterns

322

from nWave-ai/nWave

Terminal UI and CLI design patterns for product owners. Load when designing command-line tools, interactive terminal applications, or writing CLI-specific acceptance criteria.

nw-ux-principles

322

from nWave-ai/nWave

Core UX principles for product owners. Load when evaluating interface designs, writing acceptance criteria with UX requirements, or reviewing wireframes and mockups.

nw-ux-emotional-design

322

from nWave-ai/nWave

Emotional design and delight patterns for product owners. Load when designing onboarding flows, empty states, first-run experiences, or evaluating the emotional quality of an interface.

nw-ux-desktop-patterns

322

from nWave-ai/nWave

Desktop application UI patterns for product owners. Load when designing native or cross-platform desktop applications, writing desktop-specific acceptance criteria, or evaluating panel layouts and keyboard workflows.

nw-user-story-mapping

322

from nWave-ai/nWave

User story mapping for backlog management and outcome-based prioritization. Load during Phase 2.5 (User Story Mapping) to produce story-map.md and prioritization.md.

nw-tr-review-criteria

322

from nWave-ai/nWave

Review dimensions and scoring for root cause analysis quality assessment

nw-tlaplus-verification

322

from nWave-ai/nWave

TLA+ formal verification for design correctness and PBT pipeline integration

nw-test-refactoring-catalog

322

from nWave-ai/nWave

Detailed refactoring mechanics with step-by-step procedures, and test code smell catalog with detection patterns and before/after examples

nw-test-organization-conventions

322

from nWave-ai/nWave

Test directory structure patterns by architecture style, language conventions, naming rules, and fixture placement. Decision tree for selecting test organization strategy.

nw-test-design-mandates

322

from nWave-ai/nWave

Four design mandates for acceptance tests - hexagonal boundary enforcement, business language abstraction, user journey completeness, walking skeleton strategy, and pure function extraction