server-management
Server management principles and decision-making. Process management, monitoring strategy, and scaling decisions. Teaches thinking, not commands.
Best use case
server-management is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Server management principles and decision-making. Process management, monitoring strategy, and scaling decisions. Teaches thinking, not commands.
Teams using server-management should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/server-management/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How server-management Compares
| Feature / Agent | server-management | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Server management principles and decision-making. Process management, monitoring strategy, and scaling decisions. Teaches thinking, not commands.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Server Management > Server management principles for production operations. > **Learn to THINK, not memorize commands.** --- ## 1. Process Management Principles ### Tool Selection | Scenario | Tool | |----------|------| | **Node.js app** | PM2 (clustering, reload) | | **Any app** | systemd (Linux native) | | **Containers** | Docker/Podman | | **Orchestration** | Kubernetes, Docker Swarm | ### Process Management Goals | Goal | What It Means | |------|---------------| | **Restart on crash** | Auto-recovery | | **Zero-downtime reload** | No service interruption | | **Clustering** | Use all CPU cores | | **Persistence** | Survive server reboot | --- ## 2. Monitoring Principles ### What to Monitor | Category | Key Metrics | |----------|-------------| | **Availability** | Uptime, health checks | | **Performance** | Response time, throughput | | **Errors** | Error rate, types | | **Resources** | CPU, memory, disk | ### Alert Severity Strategy | Level | Response | |-------|----------| | **Critical** | Immediate action | | **Warning** | Investigate soon | | **Info** | Review daily | ### Monitoring Tool Selection | Need | Options | |------|---------| | Simple/Free | PM2 metrics, htop | | Full observability | Grafana, Datadog | | Error tracking | Sentry | | Uptime | UptimeRobot, Pingdom | --- ## 3. Log Management Principles ### Log Strategy | Log Type | Purpose | |----------|---------| | **Application logs** | Debug, audit | | **Access logs** | Traffic analysis | | **Error logs** | Issue detection | ### Log Principles 1. **Rotate logs** to prevent disk fill 2. **Structured logging** (JSON) for parsing 3. **Appropriate levels** (error/warn/info/debug) 4. **No sensitive data** in logs --- ## 4. Scaling Decisions ### When to Scale | Symptom | Solution | |---------|----------| | High CPU | Add instances (horizontal) | | High memory | Increase RAM or fix leak | | Slow response | Profile first, then scale | | Traffic spikes | Auto-scaling | ### Scaling Strategy | Type | When to Use | |------|-------------| | **Vertical** | Quick fix, single instance | | **Horizontal** | Sustainable, distributed | | **Auto** | Variable traffic | --- ## 5. Health Check Principles ### What Constitutes Healthy | Check | Meaning | |-------|---------| | **HTTP 200** | Service responding | | **Database connected** | Data accessible | | **Dependencies OK** | External services reachable | | **Resources OK** | CPU/memory not exhausted | ### Health Check Implementation - Simple: Just return 200 - Deep: Check all dependencies - Choose based on load balancer needs --- ## 6. Security Principles | Area | Principle | |------|-----------| | **Access** | SSH keys only, no passwords | | **Firewall** | Only needed ports open | | **Updates** | Regular security patches | | **Secrets** | Environment vars, not files | | **Audit** | Log access and changes | --- ## 7. Troubleshooting Priority When something's wrong: 1. **Check if running** (process status) 2. **Check logs** (error messages) 3. **Check resources** (disk, memory, CPU) 4. **Check network** (ports, DNS) 5. **Check dependencies** (database, APIs) --- ## 8. Anti-Patterns | ❌ Don't | ✅ Do | |----------|-------| | Run as root | Use non-root user | | Ignore logs | Set up log rotation | | Skip monitoring | Monitor from day one | | Manual restarts | Auto-restart config | | No backups | Regular backup schedule | --- > **Remember:** A well-managed server is boring. That's the goal.
Related Skills
webapp-testing
Web application testing principles. E2E, Playwright, deep audit strategies.
web-design-guidelines
Review UI code for Web Interface Guidelines compliance. Use when asked to "review my UI", "check accessibility", "audit design", "review UX", or "check my site against best practices".
vulnerability-scanner
Advanced vulnerability analysis principles. OWASP 2025, Supply Chain Security, attack surface mapping, risk prioritization.
ui-ux-pro-max
UI/UX design intelligence. 50 styles, 21 palettes, 50 font pairings, 20 charts, 9 stacks.
testing-patterns
Testing patterns and principles. Unit, integration, mocking strategies.
tdd-workflow
Test-Driven Development workflow principles. RED-GREEN-REFACTOR cycle.
tailwind-patterns
Tailwind CSS v4 principles. CSS-first configuration, container queries, modern patterns, design token architecture.
systematic-debugging
4-phase systematic debugging methodology with root cause analysis and evidence-based verification. Use when debugging complex issues.
skill-creator
Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends Claude's capabilities with specialized knowledge, workflows, or tool integrations.
seo-fundamentals
SEO fundamentals, E-E-A-T, Core Web Vitals, and Google algorithm principles.
python-patterns
Python development principles and decision-making. Framework selection, async patterns, type hints, project structure. Teaches thinking, not copying.
powershell-windows
PowerShell Windows patterns. Critical pitfalls, operator syntax, error handling.