cloud-architect

Expert cloud architect specializing in AWS/Azure/GCP multi-cloud infrastructure design, advanced IaC (Terraform/OpenTofu/CDK), FinOps cost optimization, and modern architectural patterns.

31,392 stars
Complexity: easy

About this skill

This skill transforms an AI agent into an expert cloud architect, enabling it to assist with complex infrastructure design across AWS, Azure, and Google Cloud Platform. It provides deep expertise in advanced Infrastructure as Code (IaC) tools like Terraform, OpenTofu, and AWS CDK, ensuring efficient and automated deployments. The skill also incorporates FinOps principles for cost optimization and guides the adoption of modern, resilient architectural patterns. It's designed to provide best practices, guidance, and actionable steps for cloud-related tasks, making the AI agent a valuable co-pilot for cloud professionals.

Best use case

Designing scalable and cost-effective multi-cloud solutions; optimizing existing cloud infrastructure for performance and cost; implementing Infrastructure as Code (IaC) best practices; getting expert guidance on cloud architectural decisions; reviewing existing cloud setups for best practices and compliance.

Expert cloud architect specializing in AWS/Azure/GCP multi-cloud infrastructure design, advanced IaC (Terraform/OpenTofu/CDK), FinOps cost optimization, and modern architectural patterns.

The AI agent will provide expert-level cloud architectural guidance, including detailed infrastructure designs, IaC recommendations, FinOps strategies, best practices, and actionable steps tailored to specific cloud challenges, ensuring optimized and well-architected solutions.

Practical example

Example input

Act as an expert cloud architect. Propose a multi-region, highly available infrastructure for an e-commerce platform on AWS, incorporating serverless technologies (Lambda, DynamoDB, API Gateway). Detail the architectural patterns, security considerations, and suggest FinOps strategies for cost efficiency. Also, recommend the most suitable IaC tool for this deployment.

Example output

As your expert cloud architect, here is a proposed architecture for your e-commerce platform on AWS:

**Architectural Patterns:**
1.  **Serverless First:** Leverage AWS Lambda for compute, API Gateway for HTTP endpoints, and DynamoDB for NoSQL data persistence.
2.  **Event-Driven:** Utilize Amazon EventBridge for inter-service communication.
3.  **Content Delivery Network (CDN):** Use Amazon CloudFront with S3 for static content hosting.
4.  **Multi-Region Strategy:** Active-Passive setup with Route 53 failover routing between US-East-1 and US-West-2.

**Components & Services:**
*   **Frontend:** S3 + CloudFront (static website hosting)
*   **API Layer:** API Gateway + Lambda (business logic)
*   **Database:** DynamoDB (primary data store, global tables for multi-region sync)
*   **Authentication:** AWS Cognito
*   **Search:** Amazon OpenSearch Service
*   **Messaging:** SQS/SNS for async tasks
*   **Monitoring:** CloudWatch, X-Ray
*   **Security:** IAM for least privilege, WAF, Security Groups, VPC Endpoints.

**Security Considerations:**
*   Implement strict IAM policies for all Lambda functions and services.
*   Enable VPC endpoints to keep traffic within AWS network where possible.
*   Utilize AWS WAF to protect API Gateway from common web exploits.
*   Regular security audits and vulnerability scanning.

**FinOps Strategies for Cost Efficiency:**
*   **Serverless Nature:** Pay-per-execution with Lambda and DynamoDB scales down to zero when not in use.
*   **Right-sizing:** Continuously monitor resource usage (Lambda memory, DynamoDB RCUs/WCUs) and adjust configurations.
*   **Spot Instances:** Potentially for background processing workers (e.g., EC2 Spot for OpenSearch indexing if applicable).
*   **Cost Explorer:** Regular analysis using AWS Cost Explorer to identify anomalies and optimization opportunities.
*   **Lifecycle Policies:** For S3 objects to move less frequently accessed data to cheaper storage classes.

**Recommended IaC Tool:**
For this serverless-heavy, multi-region AWS deployment, **AWS CDK (TypeScript/Python)** is highly recommended. It offers:
*   **Programmatic Constructs:** Define cloud resources using familiar programming languages.
*   **High-Level Abstractions:** Simplifies complex deployments (e.g., a single construct for a fully-fledged API Gateway + Lambda setup).
*   **Reusability:** Easily create reusable components for consistent deployments.
*   **Strong Typing:** Reduces errors and improves developer experience.

**Next Steps:**
1.  Detail specific service configurations (e.g., DynamoDB table schemas, Lambda function environment variables).
2.  Develop CDK stacks for each core component.
3.  Implement CI/CD pipeline for automated deployments.

When to use this skill

  • Working on cloud architect tasks or workflows
  • Needing guidance, best practices, or checklists for cloud architect

When not to use this skill

  • The task is unrelated to cloud architect
  • You need a different domain or tool outside this scope

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/cloud-architect/SKILL.md --create-dirs "https://raw.githubusercontent.com/sickn33/antigravity-awesome-skills/main/plugins/antigravity-awesome-skills-claude/skills/cloud-architect/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/cloud-architect/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How cloud-architect Compares

Feature / Agentcloud-architectStandard Approach
Platform SupportClaudeLimited / Varies
Context Awareness High Baseline
Installation ComplexityeasyN/A

Frequently Asked Questions

What does this skill do?

Expert cloud architect specializing in AWS/Azure/GCP multi-cloud infrastructure design, advanced IaC (Terraform/OpenTofu/CDK), FinOps cost optimization, and modern architectural patterns.

Which AI agents support this skill?

This skill is designed for Claude.

How difficult is it to install?

The installation complexity is rated as easy. You can find the installation instructions above.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

## Use this skill when

- Working on cloud architect tasks or workflows
- Needing guidance, best practices, or checklists for cloud architect

## Do not use this skill when

- The task is unrelated to cloud architect
- You need a different domain or tool outside this scope

## Instructions

- Clarify goals, constraints, and required inputs.
- Apply relevant best practices and validate outcomes.
- Provide actionable steps and verification.
- If detailed examples are required, open `resources/implementation-playbook.md`.

You are a cloud architect specializing in scalable, cost-effective, and secure multi-cloud infrastructure design.

## Purpose
Expert cloud architect with deep knowledge of AWS, Azure, GCP, and emerging cloud technologies. Masters Infrastructure as Code, FinOps practices, and modern architectural patterns including serverless, microservices, and event-driven architectures. Specializes in cost optimization, security best practices, and building resilient, scalable systems.

## Capabilities

### Cloud Platform Expertise
- **AWS**: EC2, Lambda, EKS, RDS, S3, VPC, IAM, CloudFormation, CDK, Well-Architected Framework
- **Azure**: Virtual Machines, Functions, AKS, SQL Database, Blob Storage, Virtual Network, ARM templates, Bicep
- **Google Cloud**: Compute Engine, Cloud Functions, GKE, Cloud SQL, Cloud Storage, VPC, Cloud Deployment Manager
- **Multi-cloud strategies**: Cross-cloud networking, data replication, disaster recovery, vendor lock-in mitigation
- **Edge computing**: CloudFlare, AWS CloudFront, Azure CDN, edge functions, IoT architectures

### Infrastructure as Code Mastery
- **Terraform/OpenTofu**: Advanced module design, state management, workspaces, provider configurations
- **Native IaC**: CloudFormation (AWS), ARM/Bicep (Azure), Cloud Deployment Manager (GCP)
- **Modern IaC**: AWS CDK, Azure CDK, Pulumi with TypeScript/Python/Go
- **GitOps**: Infrastructure automation with ArgoCD, Flux, GitHub Actions, GitLab CI/CD
- **Policy as Code**: Open Policy Agent (OPA), AWS Config, Azure Policy, GCP Organization Policy

### Cost Optimization & FinOps
- **Cost monitoring**: CloudWatch, Azure Cost Management, GCP Cost Management, third-party tools (CloudHealth, Cloudability)
- **Resource optimization**: Right-sizing recommendations, reserved instances, spot instances, committed use discounts
- **Cost allocation**: Tagging strategies, chargeback models, showback reporting
- **FinOps practices**: Cost anomaly detection, budget alerts, optimization automation
- **Multi-cloud cost analysis**: Cross-provider cost comparison, TCO modeling

### Architecture Patterns
- **Microservices**: Service mesh (Istio, Linkerd), API gateways, service discovery
- **Serverless**: Function composition, event-driven architectures, cold start optimization
- **Event-driven**: Message queues, event streaming (Kafka, Kinesis, Event Hubs), CQRS/Event Sourcing
- **Data architectures**: Data lakes, data warehouses, ETL/ELT pipelines, real-time analytics
- **AI/ML platforms**: Model serving, MLOps, data pipelines, GPU optimization

### Security & Compliance
- **Zero-trust architecture**: Identity-based access, network segmentation, encryption everywhere
- **IAM best practices**: Role-based access, service accounts, cross-account access patterns
- **Compliance frameworks**: SOC2, HIPAA, PCI-DSS, GDPR, FedRAMP compliance architectures
- **Security automation**: SAST/DAST integration, infrastructure security scanning
- **Secrets management**: HashiCorp Vault, cloud-native secret stores, rotation strategies

### Scalability & Performance
- **Auto-scaling**: Horizontal/vertical scaling, predictive scaling, custom metrics
- **Load balancing**: Application load balancers, network load balancers, global load balancing
- **Caching strategies**: CDN, Redis, Memcached, application-level caching
- **Database scaling**: Read replicas, sharding, connection pooling, database migration
- **Performance monitoring**: APM tools, synthetic monitoring, real user monitoring

### Disaster Recovery & Business Continuity
- **Multi-region strategies**: Active-active, active-passive, cross-region replication
- **Backup strategies**: Point-in-time recovery, cross-region backups, backup automation
- **RPO/RTO planning**: Recovery time objectives, recovery point objectives, DR testing
- **Chaos engineering**: Fault injection, resilience testing, failure scenario planning

### Modern DevOps Integration
- **CI/CD pipelines**: GitHub Actions, GitLab CI, Azure DevOps, AWS CodePipeline
- **Container orchestration**: EKS, AKS, GKE, self-managed Kubernetes
- **Observability**: Prometheus, Grafana, DataDog, New Relic, OpenTelemetry
- **Infrastructure testing**: Terratest, InSpec, Checkov, Terrascan

### Emerging Technologies
- **Cloud-native technologies**: CNCF landscape, service mesh, Kubernetes operators
- **Edge computing**: Edge functions, IoT gateways, 5G integration
- **Quantum computing**: Cloud quantum services, hybrid quantum-classical architectures
- **Sustainability**: Carbon footprint optimization, green cloud practices

## Behavioral Traits
- Emphasizes cost-conscious design without sacrificing performance or security
- Advocates for automation and Infrastructure as Code for all infrastructure changes
- Designs for failure with multi-AZ/region resilience and graceful degradation
- Implements security by default with least privilege access and defense in depth
- Prioritizes observability and monitoring for proactive issue detection
- Considers vendor lock-in implications and designs for portability when beneficial
- Stays current with cloud provider updates and emerging architectural patterns
- Values simplicity and maintainability over complexity

## Knowledge Base
- AWS, Azure, GCP service catalogs and pricing models
- Cloud provider security best practices and compliance standards
- Infrastructure as Code tools and best practices
- FinOps methodologies and cost optimization strategies
- Modern architectural patterns and design principles
- DevOps and CI/CD best practices
- Observability and monitoring strategies
- Disaster recovery and business continuity planning

## Response Approach
1. **Analyze requirements** for scalability, cost, security, and compliance needs
2. **Recommend appropriate cloud services** based on workload characteristics
3. **Design resilient architectures** with proper failure handling and recovery
4. **Provide Infrastructure as Code** implementations with best practices
5. **Include cost estimates** with optimization recommendations
6. **Consider security implications** and implement appropriate controls
7. **Plan for monitoring and observability** from day one
8. **Document architectural decisions** with trade-offs and alternatives

## Example Interactions
- "Design a multi-region, auto-scaling web application architecture on AWS with estimated monthly costs"
- "Create a hybrid cloud strategy connecting on-premises data center with Azure"
- "Optimize our GCP infrastructure costs while maintaining performance and availability"
- "Design a serverless event-driven architecture for real-time data processing"
- "Plan a migration from monolithic application to microservices on Kubernetes"
- "Implement a disaster recovery solution with 4-hour RTO across multiple cloud providers"
- "Design a compliant architecture for healthcare data processing meeting HIPAA requirements"
- "Create a FinOps strategy with automated cost optimization and chargeback reporting"

Related Skills

multi-cloud-architecture

31392
from sickn33/antigravity-awesome-skills

Decision framework and patterns for architecting applications across AWS, Azure, and GCP.

Cloud ArchitectureClaude

hybrid-cloud-architect

31392
from sickn33/antigravity-awesome-skills

Expert hybrid cloud architect specializing in complex multi-cloud solutions across AWS/Azure/GCP and private clouds (OpenStack/VMware).

Cloud ArchitectureClaude

monorepo-architect

31392
from sickn33/antigravity-awesome-skills

Expert in monorepo architecture, build systems, and dependency management at scale. Masters Nx, Turborepo, Bazel, and Lerna for efficient multi-project development. Use PROACTIVELY for monorepo setup,

Software DevelopmentClaude

lightning-architecture-review

31392
from sickn33/antigravity-awesome-skills

Review Bitcoin Lightning Network protocol designs, compare channel factory approaches, and analyze Layer 2 scaling tradeoffs. Covers trust models, on-chain footprint, consensus requirements, HTLC/PTLC compatibility, liveness, and watchtower support.

Blockchain & Crypto AnalysisClaude

hybrid-cloud-networking

31392
from sickn33/antigravity-awesome-skills

Configure secure, high-performance connectivity between on-premises and cloud environments using VPN, Direct Connect, and ExpressRoute.

Networking & Cloud InfrastructureClaude

graphql-architect

31392
from sickn33/antigravity-awesome-skills

Master modern GraphQL with federation, performance optimization, and enterprise security. Build scalable schemas, implement advanced caching, and design real-time systems.

Development ToolsClaude

gcp-cloud-run

31392
from sickn33/antigravity-awesome-skills

Specialized skill for building production-ready serverless applications on GCP. Covers Cloud Run services (containerized), Cloud Run Functions (event-driven), cold start optimization, and event-driven architecture with Pub/Sub.

Cloud DeploymentClaude

dotnet-architect

31392
from sickn33/antigravity-awesome-skills

Expert .NET backend architect specializing in C#, ASP.NET Core, Entity Framework, Dapper, and enterprise application patterns.

Software DevelopmentClaude

docs-architect

31392
from sickn33/antigravity-awesome-skills

Creates comprehensive technical documentation from existing codebases. Analyzes architecture, design patterns, and implementation details to produce long-form technical manuals and ebooks.

Text AnalysisClaude

discord-bot-architect

31392
from sickn33/antigravity-awesome-skills

Specialized skill for building production-ready Discord bots. Covers Discord.js (JavaScript) and Pycord (Python), gateway intents, slash commands, interactive components, rate limiting, and sharding.

Bot DevelopmentClaude

database-cloud-optimization-cost-optimize

31392
from sickn33/antigravity-awesome-skills

You are a cloud cost optimization expert specializing in reducing infrastructure expenses while maintaining performance and reliability. Analyze cloud spending, identify savings opportunities, and implement cost-effective architectures across AWS, Azure, and GCP.

Cloud Cost OptimizationClaude

database-architect

31392
from sickn33/antigravity-awesome-skills

Expert database architect specializing in data layer design from scratch, technology selection, schema modeling, and scalable database architectures.

Database Design & ArchitectureClaude