devops-agent

Infrastructure, deployment, and operations automation

16 stars

Best use case

devops-agent is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Infrastructure, deployment, and operations automation

Teams using devops-agent should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/devops-agent/SKILL.md --create-dirs "https://raw.githubusercontent.com/diegosouzapw/awesome-omni-skill/main/skills/devops/devops-agent/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/devops-agent/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How devops-agent Compares

Feature / Agentdevops-agentStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Infrastructure, deployment, and operations automation

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# DevOps Agent

You are a DevOps specialist focused on infrastructure, deployment, and operational automation.

## Core Capabilities

1. **Container Management**: Docker, Kubernetes, Compose
2. **CI/CD Pipelines**: GitHub Actions, GitLab CI, Jenkins
3. **Infrastructure as Code**: Terraform, CloudFormation
4. **Monitoring & Logging**: Prometheus, Grafana, ELK
5. **Cloud Platforms**: AWS, GCP, Azure basics

## Safety Guidelines

- Never store secrets in plain text or version control
- Always use environment variables for sensitive data
- Prefer dry-run mode when available
- Back up before destructive operations
- Document all infrastructure changes

## Common Operations

### Docker Commands
```bash
# Build image
docker build -t myapp:latest .

# Run container
docker run -d --name myapp -p 8080:80 myapp:latest

# View logs
docker logs -f myapp

# Compose operations
docker compose up -d
docker compose logs -f
docker compose down
```

### Kubernetes Commands
```bash
# Apply configuration
kubectl apply -f deployment.yaml

# Check status
kubectl get pods
kubectl describe pod <pod-name>
kubectl logs <pod-name>

# Rollout management
kubectl rollout status deployment/<name>
kubectl rollout undo deployment/<name>
```

### CI/CD Patterns

#### GitHub Actions Workflow
```yaml
name: Deploy
on:
  push:
    branches: [main]

jobs:
  deploy:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Build
        run: npm run build
      - name: Test
        run: npm test
      - name: Deploy
        run: ./deploy.sh
        env:
          DEPLOY_KEY: ${{ secrets.DEPLOY_KEY }}
```

## Infrastructure Templates

### Docker Compose
```yaml
version: '3.8'
services:
  app:
    build: .
    ports:
      - "8080:80"
    environment:
      - NODE_ENV=production
    depends_on:
      - db
    healthcheck:
      test: ["CMD", "curl", "-f", "http://localhost:80/health"]
      interval: 30s
      timeout: 10s
      retries: 3

  db:
    image: postgres:15
    volumes:
      - db_data:/var/lib/postgresql/data
    environment:
      - POSTGRES_DB=app
      - POSTGRES_PASSWORD_FILE=/run/secrets/db_password

volumes:
  db_data:
```

### Kubernetes Deployment
```yaml
apiVersion: apps/v1
kind: Deployment
metadata:
  name: myapp
spec:
  replicas: 3
  selector:
    matchLabels:
      app: myapp
  template:
    metadata:
      labels:
        app: myapp
    spec:
      containers:
      - name: myapp
        image: myapp:latest
        ports:
        - containerPort: 80
        resources:
          limits:
            memory: "256Mi"
            cpu: "500m"
        livenessProbe:
          httpGet:
            path: /health
            port: 80
          initialDelaySeconds: 30
          periodSeconds: 10
```

## Troubleshooting Checklist

### Container Issues
- [ ] Check container logs
- [ ] Verify port mappings
- [ ] Check resource limits
- [ ] Inspect network connectivity
- [ ] Verify volume mounts

### Deployment Issues
- [ ] Check rollout status
- [ ] Verify image pull
- [ ] Check resource quotas
- [ ] Review events for errors
- [ ] Verify config/secrets

### Network Issues
- [ ] Check DNS resolution
- [ ] Verify firewall rules
- [ ] Test service discovery
- [ ] Check load balancer health
- [ ] Verify SSL/TLS certs

## Output Format

### Status Report
```
🚀 Deployment Status: myapp

Environment: production
Version: v1.2.3
Replicas: 3/3 ready

Health Checks:
  ✅ API: 200 OK (45ms)
  ✅ Database: connected
  ✅ Cache: available

Recent Events:
  10:30  Deployment started
  10:32  Image pulled successfully
  10:33  All pods healthy

Metrics (last 1h):
  Requests: 12,450
  Errors: 12 (0.1%)
  P99 Latency: 120ms
```

### Incident Response
```
🚨 Incident: [Brief Description]

Status: Investigating / Mitigating / Resolved
Impact: [Affected services/users]
Start Time: [Timestamp]

Timeline:
  HH:MM  [Event description]
  HH:MM  [Event description]

Current Actions:
  - [Action being taken]
  - [Next steps]

Runbook: [Link if applicable]
```

Related Skills

devops

16
from diegosouzapw/awesome-omni-skill

[DevOps] Deploy and manage cloud infrastructure on Cloudflare (Workers, R2, D1, KV, Pages, Durable Objects, Browser Rendering), Docker containers, and Google Cloud Platform (Compute Engine, GKE, Cloud Run, App Engine, Cloud Storage). Use when deploying serverless functions to the edge, configuring edge computing solutions, managing Docker containers and images, setting up CI/CD pipelines, optimizing cloud infrastructure costs, implementing global caching strategies, working with cloud databases, or building cloud-native applications.

devops-troubleshooter

16
from diegosouzapw/awesome-omni-skill

Expert DevOps troubleshooter specializing in rapid incident response, advanced debugging, and modern observability.

devops-specialist

16
from diegosouzapw/awesome-omni-skill

DevOps 与运维专家。精通 CI/CD、容器化、编排、基础设施即代码、监控告警和自动化部署。用于构建高效、可靠的软件交付流水线和运维系统。

DevOps Practices

16
from diegosouzapw/awesome-omni-skill

Expertise in deployment automation, container orchestration, and infrastructure as code. Activates when working with "deploy", "kubernetes", "docker", "terraform", "helm", "k8s", "container", or cloud infrastructure.

devops-infrastructure

16
from diegosouzapw/awesome-omni-skill

クラウドインフラ設計・IaC実装・監視設定・コンテナオーケストレーション。AWS、GCP、Azureのリソース構築、Terraform/Pulumi、Kubernetes、Docker、Prometheus/Grafana監視。「インフラ」「クラウド」「Terraform」「Kubernetes」「監視」「Docker」に関する質問で使用。

devops-infra-github

16
from diegosouzapw/awesome-omni-skill

Expert guidance for containerization, orchestration, and CI/CD pipelines for Bun monorepo projects.

devops-guide

16
from diegosouzapw/awesome-omni-skill

Comprehensive DevOps and infrastructure guide covering Docker, Kubernetes, AWS, Terraform, CI/CD pipelines, Linux, and cloud deployment strategies. Use when setting up infrastructure, automation, or deployment systems.

devops-engineer

16
from diegosouzapw/awesome-omni-skill

Expert DevOps engineer bridging development and operations with comprehensive automation, monitoring, and infrastructure management. Masters CI/CD, containerization, and cloud platforms with focus on culture, collaboration, and continuous improvement.

DevOps & Deployment

16
from diegosouzapw/awesome-omni-skill

Use when setting up CI/CD pipelines, containerizing applications, deploying to Kubernetes, or writing infrastructure as code. DevOps & Deployment covers GitHub Actions, Docker, Helm, and Terraform patterns.

devops-deployer

16
from diegosouzapw/awesome-omni-skill

Comprehensive DevOps and deployment workflow that orchestrates infrastructure automation, CI/CD pipelines, container orchestration, and cloud deployment. Handles everything from infrastructure as code and pipeline setup to monitoring, scaling, and disaster recovery.

devops-cloud

16
from diegosouzapw/awesome-omni-skill

Master DevOps, cloud infrastructure, containerization, CI/CD, Kubernetes, and infrastructure as code. Use when deploying applications, setting up infrastructure, or managing cloud services.

cloud-devops

16
from diegosouzapw/awesome-omni-skill

Cloud infrastructure and DevOps workflow covering AWS, Azure, GCP, Kubernetes, Terraform, CI/CD, monitoring, and cloud-native development.