server-management
Server management principles and decision-making. Process management, monitoring strategy, and scaling decisions. Teaches thinking, not commands.
Best use case
server-management is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Server management principles and decision-making. Process management, monitoring strategy, and scaling decisions. Teaches thinking, not commands.
Teams using server-management should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/server-management/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How server-management Compares
| Feature / Agent | server-management | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Server management principles and decision-making. Process management, monitoring strategy, and scaling decisions. Teaches thinking, not commands.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Server Management > Server management principles for production operations. > **Learn to THINK, not memorize commands.** --- ## 1. Process Management Principles ### Tool Selection | Scenario | Tool | |----------|------| | **Node.js app** | PM2 (clustering, reload) | | **Any app** | systemd (Linux native) | | **Containers** | Docker/Podman | | **Orchestration** | Kubernetes, Docker Swarm | ### Process Management Goals | Goal | What It Means | |------|---------------| | **Restart on crash** | Auto-recovery | | **Zero-downtime reload** | No service interruption | | **Clustering** | Use all CPU cores | | **Persistence** | Survive server reboot | --- ## 2. Monitoring Principles ### What to Monitor | Category | Key Metrics | |----------|-------------| | **Availability** | Uptime, health checks | | **Performance** | Response time, throughput | | **Errors** | Error rate, types | | **Resources** | CPU, memory, disk | ### Alert Severity Strategy | Level | Response | |-------|----------| | **Critical** | Immediate action | | **Warning** | Investigate soon | | **Info** | Review daily | ### Monitoring Tool Selection | Need | Options | |------|---------| | Simple/Free | PM2 metrics, htop | | Full observability | Grafana, Datadog | | Error tracking | Sentry | | Uptime | UptimeRobot, Pingdom | --- ## 3. Log Management Principles ### Log Strategy | Log Type | Purpose | |----------|---------| | **Application logs** | Debug, audit | | **Access logs** | Traffic analysis | | **Error logs** | Issue detection | ### Log Principles 1. **Rotate logs** to prevent disk fill 2. **Structured logging** (JSON) for parsing 3. **Appropriate levels** (error/warn/info/debug) 4. **No sensitive data** in logs --- ## 4. Scaling Decisions ### When to Scale | Symptom | Solution | |---------|----------| | High CPU | Add instances (horizontal) | | High memory | Increase RAM or fix leak | | Slow response | Profile first, then scale | | Traffic spikes | Auto-scaling | ### Scaling Strategy | Type | When to Use | |------|-------------| | **Vertical** | Quick fix, single instance | | **Horizontal** | Sustainable, distributed | | **Auto** | Variable traffic | --- ## 5. Health Check Principles ### What Constitutes Healthy | Check | Meaning | |-------|---------| | **HTTP 200** | Service responding | | **Database connected** | Data accessible | | **Dependencies OK** | External services reachable | | **Resources OK** | CPU/memory not exhausted | ### Health Check Implementation - Simple: Just return 200 - Deep: Check all dependencies - Choose based on load balancer needs --- ## 6. Security Principles | Area | Principle | |------|-----------| | **Access** | SSH keys only, no passwords | | **Firewall** | Only needed ports open | | **Updates** | Regular security patches | | **Secrets** | Environment vars, not files | | **Audit** | Log access and changes | --- ## 7. Troubleshooting Priority When something's wrong: 1. **Check if running** (process status) 2. **Check logs** (error messages) 3. **Check resources** (disk, memory, CPU) 4. **Check network** (ports, DNS) 5. **Check dependencies** (database, APIs) --- ## 8. Anti-Patterns | ❌ Don't | ✅ Do | |----------|-------| | Run as root | Use non-root user | | Ignore logs | Set up log rotation | | Skip monitoring | Monitor from day one | | Manual restarts | Auto-restart config | | No backups | Regular backup schedule | --- > **Remember:** A well-managed server is boring. That's the goal. ## When to Use This skill is applicable to execute the workflow or actions described in the overview.
Related Skills
Ground Truth Management
Comprehensive guide to creating, managing, and maintaining ground truth datasets for AI evaluation including annotation, quality control, and versioning
data-management
Comprehensive DataFrame loading, filtering, transformation, and data pipeline management from Excel, CSV, and multiple sources with YAML-driven configuration.
composer-dependency-management
Rules pertaining to Composer dependency management, promoting best practices for declaring and updating dependencies.
claude-config-management
Claude Code設定(リポジトリルート)の構成管理ガイド。ファイルレベルsymlinkによる設定管理、管理対象の追加・削除、Taskfileタスクの実行方法を提供する。「設定ファイルを追加して」「新しいスキルを追加して」「symlinkの状態を確認して」「Claude設定を変更して」のようにClaude Code設定の構成変更を行うときに使用する。
ck:project-management
Track progress, update plan statuses, manage Claude Tasks, generate reports, coordinate docs updates. Use for project oversight, status checks, plan completion, task hydration, cross-session continuity.
agentpmt-tool-file-management-d789ed
Use AgentPMT external API to run the File Management tool with wallet signatures, credits purchase, or credits earned from jobs.
advanced-file-management
Advanced file management tools. Includes batch folder creation, batch file moving, file listing, and HTML author extraction.
1k-state-management
Jotai state management patterns for OneKey. Use when working with atoms, global state, feature state, or context atoms. Triggers on jotai, atom, state, globalAtom, contextAtom, store, persistence, settings.
ads-management
Activate for paid advertising campaigns on Google Ads, Meta Ads, LinkedIn Ads, TikTok Ads. Includes ad copywriting, audience targeting, budget optimization, A/B testing, and ROAS tracking. Used by ads-specialist and campaign-manager agents.
kanban-management
Manages the Anubis Issue Tracker GitHub project board. Use when you need to organize issues by difficulty/status, move issues through workflow stages, or generate board status reports.
github-release-management
Comprehensive GitHub release orchestration with AI swarm coordination for automated versioning, testing, deployment, and rollback management
amia-github-thread-management
Use when managing PR review threads. Reply does NOT auto-resolve threads. Trigger with /manage-threads.