Cloud Cost Optimization Audit

Analyze cloud infrastructure spend across AWS, Azure, and GCP. Identify waste, rightsizing opportunities, and reserved instance savings.

3,891 stars

Best use case

Cloud Cost Optimization Audit is best used when you need a repeatable AI agent workflow instead of a one-off prompt. It is especially useful for teams working in multi. Analyze cloud infrastructure spend across AWS, Azure, and GCP. Identify waste, rightsizing opportunities, and reserved instance savings.

Analyze cloud infrastructure spend across AWS, Azure, and GCP. Identify waste, rightsizing opportunities, and reserved instance savings.

Users should expect a more consistent workflow output, faster repeated execution, and less time spent rewriting prompts from scratch.

Practical example

Example input

Use the "Cloud Cost Optimization Audit" skill to help with this workflow task. Context: Analyze cloud infrastructure spend across AWS, Azure, and GCP. Identify waste, rightsizing opportunities, and reserved instance savings.

Example output

A structured workflow result with clearer steps, more consistent formatting, and an output that is easier to reuse in the next run.

When to use this skill

  • Use this skill when you want a reusable workflow rather than writing the same prompt again and again.

When not to use this skill

  • Do not use this when you only need a one-off answer and do not need a reusable workflow.
  • Do not use it if you cannot install or maintain the related files, repository context, or supporting tools.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/afrexai-cloud-cost-audit/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/1kalin/afrexai-cloud-cost-audit/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/afrexai-cloud-cost-audit/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How Cloud Cost Optimization Audit Compares

Feature / AgentCloud Cost Optimization AuditStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Analyze cloud infrastructure spend across AWS, Azure, and GCP. Identify waste, rightsizing opportunities, and reserved instance savings.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# Cloud Cost Optimization Audit

Analyze cloud infrastructure spend across AWS, Azure, and GCP. Identify waste, rightsizing opportunities, and reserved instance savings.

## What This Skill Does

When given cloud spend data (billing exports, cost explorer screenshots, or manual input), this skill:

1. **Categorizes spend** across 8 cost domains (compute, storage, networking, databases, AI/ML, observability, security, licensing)
2. **Identifies waste patterns** using 12 common anti-patterns
3. **Calculates savings** with specific dollar amounts per optimization
4. **Prioritizes actions** by effort vs. impact (quick wins → strategic moves)
5. **Generates executive summary** with 90-day roadmap

## Cost Domains & Benchmarks (2026)

### 1. Compute (typically 40-55% of total)
- **Idle instances**: >30% idle = waste. Benchmark: <10% idle capacity
- **Rightsizing**: 60% of instances are oversized by 1+ size category
- **Spot/preemptible**: Batch workloads not on spot = 60-80% overpay
- **Reserved/savings plans**: On-demand for steady-state = 30-50% overpay
- **Container density**: <40% CPU utilization on nodes = poor bin-packing

### 2. Storage (typically 10-20%)
- **Tiering**: Data not accessed in 90 days still on hot storage = 60-80% overpay
- **Snapshot sprawl**: Orphaned snapshots older than 30 days
- **Duplicate data**: Cross-region replication without business justification
- **Object lifecycle**: No lifecycle policies = guaranteed bloat

### 3. Networking (typically 8-15%)
- **Cross-AZ traffic**: Unnecessary data transfer between zones ($0.01-0.02/GB)
- **NAT gateway abuse**: High-throughput through NAT vs. VPC endpoints
- **CDN miss rate**: >20% miss rate = CDN config issue
- **Egress optimization**: No committed use discounts on egress

### 4. Databases (typically 10-20%)
- **Over-provisioned RDS/Cloud SQL**: Multi-AZ for dev/staging environments
- **Read replica sprawl**: Replicas with <5% query load
- **DynamoDB/Cosmos over-provisioning**: Provisioned capacity 3x+ actual usage
- **License waste**: Commercial DB when open-source works

### 5. AI/ML Infrastructure (growing — 5-25%)
- **GPU idle time**: Training instances running 24/7 for 4hr/day workloads
- **Inference over-provisioning**: GPU instances for CPU-viable inference
- **Model storage**: Old model versions consuming storage
- **API costs**: Frontier model API calls without caching layer

### 6. Observability (typically 3-8%)
- **Log ingestion bloat**: Debug logs in production, duplicate log streams
- **Metric cardinality**: High-cardinality custom metrics ($$$)
- **Trace sampling**: 100% trace sampling when 10% suffices
- **Retention overkill**: 13-month retention for non-compliance data

### 7. Security (typically 2-5%)
- **WAF rule bloat**: Managed rule groups not actively tuned
- **Key management**: KMS keys for non-sensitive data
- **Compliance scanning**: Overlapping tools doing same checks

### 8. Licensing (typically 5-15%)
- **Shelfware**: Paid seats not logged in 60+ days
- **Duplicate tools**: Multiple tools solving same problem
- **Enterprise tiers**: Enterprise features unused, paying enterprise price

## 12 Waste Anti-Patterns

| # | Pattern | Typical Waste | Fix Effort |
|---|---------|--------------|------------|
| 1 | Zombie resources (stopped but attached) | 5-15% of bill | Low |
| 2 | Over-provisioned instances | 15-30% compute | Medium |
| 3 | No reserved capacity strategy | 25-40% compute | Medium |
| 4 | Hot storage hoarding | 40-70% storage | Low |
| 5 | Cross-AZ data transfer abuse | 10-30% network | Medium |
| 6 | Dev/staging mirrors production | 20-40% of envs | Low |
| 7 | Orphaned snapshots/AMIs | 3-8% storage | Low |
| 8 | Log ingestion without sampling | 30-60% observability | Low |
| 9 | GPU instances for CPU workloads | 70-85% compute | Medium |
| 10 | No spot/preemptible for batch | 60-80% batch | Medium |
| 11 | Shelfware licenses | 20-40% licensing | Low |
| 12 | No tagging = no accountability | Unmeasurable | High |

## Savings Estimation Framework

For each finding, calculate:
```
Annual Savings = (Current Cost - Optimized Cost) × 12
Implementation Cost = Engineering Hours × Loaded Rate
ROI = (Annual Savings - Implementation Cost) / Implementation Cost
Payback Period = Implementation Cost / (Annual Savings / 12)
```

### Typical Savings by Company Size
| Company Size | Monthly Cloud Spend | Typical Waste % | Annual Savings |
|-------------|-------------------|----------------|---------------|
| Startup (5-15) | $2K-$15K | 35-50% | $8K-$90K |
| Growth (15-50) | $15K-$80K | 25-40% | $45K-$384K |
| Mid-market (50-200) | $80K-$500K | 20-35% | $192K-$2.1M |
| Enterprise (200+) | $500K-$5M+ | 15-25% | $900K-$15M+ |

## Output Format

Generate a report with:
1. **Executive Summary**: Total spend, waste identified, savings potential, top 3 quick wins
2. **Domain Breakdown**: Spend per domain vs. benchmarks
3. **Findings Table**: Each finding with current cost, optimized cost, savings, effort, priority
4. **90-Day Roadmap**: Week 1-2 quick wins, Week 3-6 medium effort, Week 7-12 strategic
5. **Governance Recommendations**: Tagging strategy, budget alerts, review cadence

## Usage

Provide your cloud billing data in any format:
- AWS Cost Explorer export / Azure Cost Management / GCP Billing
- Monthly bill summary
- Architecture description with approximate sizing
- Or just describe your stack and team size for estimates

The agent will analyze and produce the full optimization report.

---

## Want Industry-Specific Cloud Optimization?

Different industries have different compliance, data residency, and workload patterns that change the optimization calculus entirely.

**Get your industry context pack** — pre-built frameworks for Fintech, Healthcare, Legal, SaaS, Ecommerce, Construction, Real Estate, Recruitment, Manufacturing, and Professional Services.

🛒 Browse packs: https://afrexai-cto.github.io/context-packs/
🧮 Calculate your AI savings: https://afrexai-cto.github.io/ai-revenue-calculator/
🤖 Set up your agent: https://afrexai-cto.github.io/agent-setup/

**Bundle deals:**
- Pick 3 packs: $97
- All 10 packs: $197
- Everything bundle: $247

Related Skills

Payroll Compliance Auditor

3891
from openclaw/skills

Run a full payroll audit in under 10 minutes. Catches the errors that cost companies $845 per violation.

Payroll & HR Compliance

Margin Analysis & Profit Optimization

3891
from openclaw/skills

Analyze gross, operating, and net margins by product line, customer segment, and channel. Identify margin erosion patterns and build pricing power.

Business Analysis

Energy Audit — Commercial Building Assessment

3891
from openclaw/skills

Run a full energy audit for commercial or industrial facilities. Identifies waste, models savings, and generates a prioritized retrofit roadmap with ROI timelines.

Sustainability & Efficiency

Compliance & Audit Readiness Engine

3891
from openclaw/skills

Your AI compliance officer. Guides startups and scale-ups through SOC 2, ISO 27001, GDPR, HIPAA, and PCI DSS — from zero to audit-ready. No consultants needed.

Security

Compliance Audit Generator

3891
from openclaw/skills

Run internal compliance audits against major frameworks without hiring a consultant.

Security

Customer Acquisition Cost (CAC) Optimizer

3891
from openclaw/skills

Analyze, benchmark, and reduce your customer acquisition cost across every channel.

Business Analytics & Strategy

AI Spend Audit

3891
from openclaw/skills

Audit your company's AI spending — find waste, measure ROI, and right-size your tool stack.

AI Safety Audit

3891
from openclaw/skills

Comprehensive AI safety and alignment audit framework for businesses deploying AI agents. Built around the UK AI Security Institute Alignment Project standards (2026), EU AI Act requirements, and NIST AI RMF.

Security

cloudflare-manager

3891
from openclaw/skills

Manage Cloudflare DNS records, Tunnels (cloudflared), and Zero Trust policies. Use for pointing domains, exposing local services via tunnels, and updating ingress rules.

DevOps & Infrastructure

SX-security-audit

3891
from openclaw/skills

全方位安全审计技能。检查文件权限、环境变量、依赖漏洞、配置文件、网络端口、Git 安全、Shell 安全、macOS 安全、密钥检测等。支持 CLI 参数、JSON 输出、配置文件。当用户要求"安全检查"、"漏洞扫描"、"权限检查"、"安全审计"时使用此技能。

Security

Skill Audit 🔍

3891
from openclaw/skills

扫描 OpenClaw skills 中的安全风险,防止供应链攻击。

Security

3d-wordcloud-visualizer

3891
from openclaw/skills

3D 词云可视化工具 - 将对话历史或其他文本数据自动转换为炫酷的 3D 地球词云,支持多格式文件导入(JSON/MD/TXT),自动中文分词和词频统计,生成 TOP30 高频词的 3D 可视化效果

Data Visualization