configuring-auto-scaling-policies

Configure use when you need to work with auto-scaling. This skill provides auto-scaling configuration with comprehensive guidance and automation. Trigger with phrases like "configure auto-scaling", "set up elastic scaling", or "implement scaling".

1,868 stars

byjeremylongshore

View on GitHub Installation ↓

Best use case

configuring-auto-scaling-policies is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using configuring-auto-scaling-policies should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/configuring-auto-scaling-policies/SKILL.md --create-dirs "https://raw.githubusercontent.com/jeremylongshore/claude-code-plugins-plus-skills/main/plugins/devops/auto-scaling-configurator/skills/configuring-auto-scaling-policies/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/configuring-auto-scaling-policies/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How configuring-auto-scaling-policies Compares

Feature / Agent	configuring-auto-scaling-policies	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

AI Agents for Coding

Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.

Best AI Skills for Claude

Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.

ChatGPT vs Claude for Agent Skills

Compare ChatGPT and Claude for AI agent skills across coding, writing, research, and reusable workflow execution.

SKILL.md Source

# Configuring Auto-Scaling Policies

## Overview

Configure auto-scaling policies for cloud workloads across AWS Auto Scaling Groups, GCP Managed Instance Groups, Azure VMSS, and Kubernetes Horizontal Pod Autoscaler (HPA). Generate scaling configurations based on CPU, memory, request rate, or custom metrics with appropriate thresholds, cooldown periods, and scale-in protection.

## Prerequisites

- Cloud provider CLI installed and authenticated (`aws`, `gcloud`, or `az`)
- For Kubernetes HPA: `kubectl` configured with cluster access and metrics-server deployed
- Baseline performance data for the target workload (average CPU, memory, request rate)
- Understanding of traffic patterns (steady, bursty, scheduled)
- IAM permissions to create/modify scaling policies and CloudWatch/Stackdriver alarms

## Instructions

1. Identify the scaling target: EC2 Auto Scaling Group, GCP MIG, Azure VMSS, or Kubernetes Deployment
2. Analyze current workload metrics to establish baseline utilization and peak patterns
3. Define scaling boundaries: minimum instances/pods, maximum instances/pods, desired count
4. Select scaling metric(s): CPU utilization, memory, request count, queue depth, or custom metrics
5. Set target thresholds: scale-out trigger (e.g., CPU > 70%), scale-in trigger (e.g., CPU < 30%)
6. Configure cooldown periods to prevent flapping (typically 300s scale-out, 600s scale-in)
7. Add scale-in protection for stateful workloads or leader nodes if needed
8. Generate the scaling policy configuration in the appropriate format (Terraform, YAML, or CLI commands)
9. Validate by simulating load and confirming scaling events fire correctly

## Output

- Terraform HCL for AWS ASG scaling policies with CloudWatch alarms
- Kubernetes HPA manifests (YAML) with resource or custom metric targets
- GCP autoscaler configurations for Managed Instance Groups
- Scaling policy JSON/YAML for Azure VMSS
- CloudWatch or Stackdriver alarm definitions tied to scaling actions

## Error Handling

| Error | Cause | Solution |
|-------|-------|---------|
| `No scaling activity despite high load` | Metric not reaching threshold or cooldown active | Verify metric source in CloudWatch/Stackdriver; check cooldown timer with `describe-scaling-activities` |
| `Scaling too aggressively (flapping)` | Cooldown too short or threshold too sensitive | Increase cooldown period and widen the gap between scale-out and scale-in thresholds |
| `Max capacity reached` | Instance/pod limit hit during traffic spike | Raise `max_size` or implement request queuing as a backpressure mechanism |
| `HPA unable to compute replica count` | Metrics server not deployed or metric unavailable | Install metrics-server and verify `kubectl top pods` returns data |
| `FailedScaleUp: insufficient capacity` | Cloud provider out of capacity in selected AZ/region | Add multiple AZs to the ASG or use mixed instance types with allocation strategy |

## Examples

- "Configure an AWS ASG with target tracking at 65% CPU, min 2 / max 20 instances, and 5-minute cooldown."
- "Create a Kubernetes HPA for a deployment that scales from 3 to 50 pods based on requests-per-second using a custom Prometheus metric."
- "Set up scheduled scaling for a GCP MIG: scale to 10 instances at 8am UTC and back to 2 at 10pm."

## Resources

- AWS Auto Scaling: https://docs.aws.amazon.com/autoscaling/ec2/userguide/
- Kubernetes HPA: https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale/
- GCP Autoscaler: https://cloud.google.com/compute/docs/autoscaler
- Azure VMSS Autoscale: https://learn.microsoft.com/en-us/azure/virtual-machine-scale-sets/virtual-machine-scale-sets-autoscale-overview

Related Skills

automating-api-testing

1868

from jeremylongshore/claude-code-plugins-plus-skills

Test automate API endpoint testing including request generation, validation, and comprehensive test coverage for REST and GraphQL APIs. Use when testing API contracts, validating OpenAPI specifications, or ensuring endpoint reliability. Trigger with phrases like "test the API", "generate API tests", or "validate API contracts".

windsurf-release-automation

1868

from jeremylongshore/claude-code-plugins-plus-skills

Execute automate release processes with semantic versioning. Activate when users mention "release automation", "version bump", "changelog generation", "semantic release", or "publish release". Handles release engineering automation. Use when working with windsurf release automation functionality. Trigger with phrases like "windsurf release automation", "windsurf automation", "windsurf".

windsurf-flows-automation

1868

from jeremylongshore/claude-code-plugins-plus-skills

Create and manage Windsurf Flows for repetitive tasks. Activate when users mention "windsurf flows", "task automation", "workflow automation", "repetitive tasks", or "process automation". Handles Flow creation and management. Use when working with windsurf flows automation functionality. Trigger with phrases like "windsurf flows automation", "windsurf automation", "windsurf".

managing-autonomous-development

1868

from jeremylongshore/claude-code-plugins-plus-skills

Execute enables AI assistant to manage sugar's autonomous development workflows. it allows AI assistant to create tasks, view the status of the system, review pending tasks, and start autonomous execution mode. use this skill when the user asks to create a new develo... Use when appropriate context detected. Trigger with relevant phrases based on skill purpose.

configuring-service-meshes

1868

from jeremylongshore/claude-code-plugins-plus-skills

Configure this skill configures service meshes like istio and linkerd for microservices. it generates production-ready configurations, implements best practices, and ensures a security-first approach. use this skill when the user asks to "configure service ... Use when appropriate context detected. Trigger with relevant phrases based on skill purpose.

managing-network-policies

1868

from jeremylongshore/claude-code-plugins-plus-skills

Execute use when managing Kubernetes network policies and firewall rules. Trigger with phrases like "create network policy", "configure firewall rules", "restrict pod communication", or "setup ingress/egress rules". Generates Kubernetes NetworkPolicy manifests following least privilege and zero-trust principles.

configuring-load-balancers

1868

from jeremylongshore/claude-code-plugins-plus-skills

Configure use when configuring load balancers including ALB, NLB, Nginx, and HAProxy. Trigger with phrases like "configure load balancer", "create ALB", "setup nginx load balancing", or "haproxy configuration". Generates production-ready configurations with health checks, SSL termination, sticky sessions, and traffic distribution rules.

automating-database-backups

1868

from jeremylongshore/claude-code-plugins-plus-skills

Automate database backup processes with scheduling, compression, and encryption. Supports PostgreSQL (pg_dump), MySQL (mysqldump), MongoDB (mongodump), and SQLite. Generates production-ready backup scripts with retention policies and restore procedures. Trigger: "automate database backups", "schedule backups", "create backup script", "disaster recovery". Use when working with automating database backups. Trigger with 'automating', 'database', 'backups'.

preprocessing-data-with-automated-pipelines

1868

from jeremylongshore/claude-code-plugins-plus-skills

Process automate data cleaning, transformation, and validation for ML tasks. Use when requesting "preprocess data", "clean data", "ETL pipeline", or "data transformation". Trigger with relevant phrases based on skill purpose.

building-automl-pipelines

1868

from jeremylongshore/claude-code-plugins-plus-skills

Build automated machine learning pipelines with feature engineering, model selection, and hyperparameter tuning. Use when automating ML workflows from data preparation through model deployment. Trigger with phrases like "build automl pipeline", "automate ml workflow", or "create automated training pipeline".

google-sheets-automation

1868

from jeremylongshore/claude-code-plugins-plus-skills

Google Sheets Automation - Auto-activating skill for Business Automation. Triggers on: google sheets automation, google sheets automation Part of the Business Automation skill category.

automating-mobile-app-testing

1868

from jeremylongshore/claude-code-plugins-plus-skills

This skill enables automated testing of mobile applications on iOS and Android platforms using frameworks like Appium, Detox, XCUITest, and Espresso. It generates end-to-end tests, sets up page object models, and handles platform-specific elements. Use this skill when the user requests mobile app testing, test automation for iOS or Android, or needs assistance with setting up device farms and simulators. The skill is triggered by terms like "mobile testing", "appium", "detox", "xcuitest", "espresso", "android test", "ios test".